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(54) IVanslatIng apparatus and translating method 

(57) A translating apparatus and a translating 
method wherein a first language sentence Is divkled (4) 
into syntax units consisting of predetermined units of 
sentence structure such as clauses and phrases in 
stages from large syntax units into small syntax units 
and at each stage stored (3.5) examples most similar to 
these syntax units are detected (6) using probability 
(1 1) models taking Into account grammatical attributes 
of the syntax units and of the examples and using gen- 
eralized linguistic knowledge and with reference to a 
thesaurus (1 2) and tiie syntax units are translated (7) on 
the basis of these detected examples and the results of : 
translation of tiie syntax units are compounded to gen- 
erate a secorxJ language sentence. The invention 
makes it possible to carry out hlgh<iuality translating 
effciently ard therefore at a high processing speed 
while elintinating ttie need for many superfidally differ- 
ent examples having the same meaning content to be 
prepared and the need fbr ttie thesaurus (1 2) to be reg- 
ulariy structured or have uniform distances between 
concepts^- 
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DescripQon 

Background of the Invention 

1 . Field of the Invention 

This Invention relates to a translafing apparatus and a translating method, and particularly to a translating appara- 
tus and a translating method for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed in a second language using examples. 

2. Description of the Related Art 

Translating apparatuses for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed in a second language can k>e generally civkled into the three classes: rulSKfriven translation 
apparatuses, example-using translation apparatuses, and example-driven translation apparatuses. 

Fig. 1 shows the construction of an example of a rule-driven translation apparatus. In Fig. 1, an inputting part 1 con- 
sists of for example a keytx>ard or a voice recognition device or a character recognition device; a first language sen- 
tence is inputted into it and is converted into a form such as text data and outputted. That is, when the inputting part 1 
consists of a keytx)ard. a first language Sentence is inputted by the keytsoard being operated and test data correspond- 
ing to that operation is outputted. When the iiputting peirt 1 is a voice recognition devk^ a first language sentence is 
inputted as votes and voice-recognized. Text data don-esponding to the results of the voice recognition Is then outputted. 
When ttie inputting part 1 is a character recognition device, a first language sentence written for example on paper is 
inputted (read) and this first language sentence is character-recognized. Text data corresponding to tiie results of this 
character recognition is then outputted. 

The text data corresponding to the first language sentence outputted from ttie inputting part 1 is si^aplied to an ana- 
lyzing part 31. In the analyzing part 31, ttie first language sentence from the inputting part 1 is language-processed 
(analyzed) on tiie basis of knowledge and rules relating to the ifirst language, arxf the results of this analysis are sup- 
plied to a converting part 32. The converting part 32 converts ttie analysis results from ttie analyzing part 31 into an 
intermediate language sentence of a prescribed intermediate language on the tsasis of knowledge and rules relating to 
tiie first language and a second language, and outputs tills to a generating part 33. The generating part 33 generates 
from ttie intermediate language sentence signed from ttie converting part 32 a oorresponcfing second language sen- 
tence, ttiat is, a translation (second language sentence) consisting of the first language sentence transited into the 
secofKl language, on the ba^s of knowledge and rUes relating to the secorid langi^^ 

An outputting part 1 6 is made ip of for example a display or a voice synUiesizing device and a qpeaker or the Gke, 
and displays or outputs in a synttiesized voice ttie second language senterice supplied from ttie generating part 33. ; 

Fig. 2 shows ttie construction of an example of an example-using translating apparatus. Parts in the figure oorre- . 
sponcfing to parts in Rg. 1 have been given the same reference numerals and will not be described in ttie following. 
Apart from being provided witti a collating part 41 and a repladng part 42. tifiis example-using translating apparatus is 
of the same construction as ttie rule-driven translation apparatus of Rg. 1 . 

In tills example-using translating apparatus, examples in sentence units expressed in a first language and corre- 
sponding translations consisting of tiie examples translated into a second language (hereinafter for convenience ttiese 
examples and tfieir corresponding translations will be called translation example data) are stored for example in the 
form of parameters, and in the collating part 41 the first language senter^e CHitputted ttie inputting part 1 is collated 
with ttie examples and any exanple matching ttie forst language sentence is ttiereby d^ected. When there is an exam- 
ple which matches the first language sentence, ttie collating part 41 controls the replacing part 42 to replace ttie frst 
language sentence with ttie translation conrespondng to the example matching it Accordingly, in the repladng part 42, 
the first language sentence is replaced witii the example matching it and supplied to the outputting part 16. 

When on the other harxi there is no example which matches the first language sentence, the collating part 41 out- 
puts ttie first language sentence to ttie analyzing part 31. Thereafter, in the analyzing part 31, the converting part 32 
and ttie generating part 33, the same processing as in the case shown in Fig. 1 is earned out, and the second langus^e 
sentence obtained as a result is supplied to ttie outputting part 16. 

Details of this kind of example-using translating apparati^ are dsdosed for example in Japanese Unexamined Pat- 
ent PubGcation Na H.6-681 34. 

Next, Rg. 3 shows the construction of an example of an example-driven translating apparatus. Parts in Fig, 3 the 
same as parts in Fig. 1 have been given the same reference numerals and will not be described in tiie foHowing. 

In tills exampl^Kiriven translating apparatus, a first language sentence outputted l>y ttie inputting part 1 is supplied 
to a Converting part 51 . and when tiie converting part 51 receives the first language senterKS from the inputting part 1 
it controls a searching part 52 to search for the example most similar to that first language sentence. 

That is. translation example data is stcM'ed in sentence units in a translation example memory 53, and the searcfiing 
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part 52 first refers to the translation exanrple memory 53 and searches for an example which matches the first language 
sentence. When it finds an example which matches the first language sentence, the searching part 52 outputs this 
example and its corresponding translation to the converting part 51. In this case, the converting part 51 supplies the 
translation from the searching part 52 unchanged to the oulputling part 1 6 as a second language sentence. 

When on the other hand it cannot find an example which matches the first language sentence, the searching part 
52 successively reads the examples stored in the translation example memory 53 and supplies them to a siinlarity 
degree calculating part 54. The searching part 52 makes the similarity degree calculating part 54 calculate a stntiarity 
degree expressing the conceptual similarity (the similarity in meaning) between eac^ of the exanples and the first laiv 
guage sentence using external knowledge such as for example a thesaurus. 

That is. a thesaurus wherein words are classified on the basis of their concepts in a tree structure is stored In a 
thesaurus memory part 55. In the thesaurus, nodes of the tree structure are equivalent to meaning concepts and so- 
called leaf parts are equivalent to words. Refem'ng to tiiis thesaurus, the similarity degree calculating part 54 cateulates 
a degree of similarity between ttie first language serTtence and ttie exanrples on the basis of tiie classes to whfch con- 
cepts conmon to words constituting the first language sentence and words constituting the examples belong. ITie 
searching part 52 then finds in ttie translation example memory.53 the exanrple of whtoh the similarity degree calcu- 
lated by the similarity degree catoulating part 54 is ttie highest and supplies ttie translatkm corresponding to ttiat exam- 
ple Id ttie converting part 51. 

When the converting part 51 receives ttie translation from ttie searching part 52. it replaces those woixis of ttie 
translation which do not match (correspond with) words of the first language sentence witti translations of ttiose words 
and outputs ttiis to ttie outputting part 16 as a second language sentenca 

Details of ttiis kind of example<iriven translating apparatus are disclosed for example in Japanese Unexamined 
Patent Publication No. H,3-276367. Also, details of mettiods of calculating ttie degree of similarity between a first lan- 
guage sentence and an example are also disclosed in for example Japanese Unexamined Patent Publk»tion Na K4- 
188276. Japanese Unexamined Patent PubTication No. H.6-274546 and Japanese Unexamined Patent Publication Na 
H.6-274548 as well as ttie aforementioned Japanese Unexamined Patent Pitolication No. H.3-276367. 

However, in ttie kinds of translating apparatus described above, there have been ttie following problems. 

That is. in rule-driven translation apparatuses there has been the problem ttiat infonnation required for translation 
is sometimes lost in ttie process of analysis of ttie first language sentence, and consequentiy results of translation of 
greetings and otiier set phrases have become word-for-word in tona Also^ when the fffst language sentence includes, 
for exanple provert>s or ottier idiomatic expressfons Odioms). translating ttiese oon-ectfy has been difficult Alsa b(4d- 
ing and maintaining as a data base the knowledge and mies relating to ttie first language and the second language 
used in rule-driven translation apparatuses has not been easy. 

Wrth example-using translating apparatuses, on the ottier hand, because ttie f irst language sentence is translated 
using a translation con-esponding to an example matching it, if greetings and ottier set phrases and idioms and so on 
are stored as examples, it is possible to obtain a second language sentence which is a natural expressioa However, in . 
an exanple-using translating apparatus, when no example matching ttie first language sentence is stored, because 
processing similar to that of a rul&<iriven translation apparatus is carried out after all. the kinds of problem mentioned 
above arise, and furthermore storing example completely matching the character strings of all first language sen- 
tences that might be inputted has been problematic. 

In an example<lriven translating apparatus, because translation is canied put using exanples whatever ttie nature 
of the first language sentence, the problems of ttie rule-driven translation apparatus do not arise, and also it is not nec- 
essary to store examples completely matching ail possible first language sentences. 

However, in an exanrple-driven translating apparatus, when tiiere is no example matching ttie first language sen- 
tence, the example most similar to it is found and those of ttie words constituting ttie conresponding translation which 
do not match words constituting the first language sentence are simply replaced witti translations of ttiose words and 
this becomes the translation result i.e. the second language sentence. Therefore, in an example-driven translating 
apparatus, linguistic knowledge and ailes are not reflected in a generalized form and consequentiy ttie quality of ttie 
translation (ttie accuracy of ttie translation and ttie naturalness of ttie translation result) can only be increased by 
increasing ttie number of examples stored in ttie translation example memory 53 (whereas ttie quality of ttie translations 
of a njle-driven translation apparatus can also be improved ttirough the way in wNch knowledge and rules relating to 
the languages are descn'becQ. Increasing ttie number of examples ^stored in ttie translation example memory 53 
increases ttie time required for the processing of searching for an example matching or similar to tiie f rst language sen- 
tenca 

The calculation of ttie similarity decree D (I.E) between a first language sentence 1 and an example phrase E in an 
example-driven translating apparatus, when ttie words constituting ttie first language sentence I are expressed h. Ig, 
• • • , it and the words constituting ttie example phrase E are expressed . e2. • • • . Ot (where t represents ttie number 
of words constituting ttie first language sentence and ttie example respectively) has been canied out as shown in Exp. 
(1) by finding ttie distance in meaning (tiie conceptual distance) wofd-dislance(ik,e^) between each word Ik constituting 
ttie first language sentence 1 and ttie conresponding word e^ in ttie example phrase E, assigning to it a weighty corre- 
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spondng to the importance of the word ei^, and otJtaining the sum total of these weighted distances. 



D(l,E)a ^ woid-distance(i ^.e ^)xweight ^ 



With this Idnd of similarity degree calculating method there have been the following kinds of prot)lem. 

That is, for exanple when the meaning content of a certain exarrple Is spoken, although sometimes the example 
itself is spoken, in most cases a word such as the word constituting the sutsject in the example Is omitted or extra words 
are inserted. With the calculation method described above, becaxxse it is prestpposed that the first language sentence 
and the example are made up of the same number of words, when the numbers of words in the two are different, H has 
been unclear whether or not it is possble to obtain a sirralarity degree correctly reflecting the sin^larity of the example 
to the first language sentence. 

Also, reversely, even if the first language sentence and the example are made up of the same nurTt>er of words, 
when their word orders are different again it has been unclear whether or not it is possa^le to obtain a sintilarity degree 
correctly reflecting the simiterity of the example to the first langiage sentence. That is. if the first language Is a language 
wherein words can be lined up in a relatively free order like for example Japanese, when for e)fflmple *RINQO O 
WATASHI WA TABETAI' fan ajpple - 1 want to eat! is inputted as the first language sentence, even if for example 
WATASHI WA MIKAN O TABETAI' CI want to eat an orange^ stored as an exanple. because the 'RINGCy Cappl^ 
and the WATASHr CO of the first language sentence respectively correspond to the WATASHI' CO and the "MIKAN* 
CorangeO of the example, recognizing that the two semences are sliTii^ 

Also, when for example the inputting part 1 consists of a voice recognition device, it sometimes happ^is that the 
speaker does not speak clearly and correct recognition is not achieved in the voice recognition device arxl text data 
wherein particles and the like are missing is outputted. That is. it sometimes happens for exariiple tiiat although the first 
language sentence intended by the speaker was "WATASHI WA. AiSIATA NO MIKAN OA TABETAI' CI want to eat ydur 
orangeO the output of the inputting part 1 becomes "WATASHI. ANATA. MIKAN, TABETAr CL you. orange, want to eaO- 
In tills kind of case, when for example WATASHI WA MIKAN O TABETAI' CI want to eat an orange*) is stored as an' 
example, the similarity degree should be calculated witti the compound noun 'ANATA NO MIKAN' Cywr orange*), con- 
sisting of ttie 'ANATA' and ttie "MIKAN* in the output of tfie inputting part 1 Joined togettier, being mule to correspond 
with the "MIKAN* Corange*) in ttie exarrple, but with a mettiod using Exp. (1) ttiis has been difficult 

To handle an Input whose meaning content matches that of an example but which differs from tiie exarrple as it 
were superf idally, as desaibed above, it is necessary to add examples corresponding to tiiese superficial variations to 
the translation example memory 53, and as mentioned above this increases tire time required for tt)e searching proc- 
ess. • ■ > -, • ■ ■ : , ^• 

Also, the distance word<fi8tanceCik.ek) between tiie word Ik and tiie word e^ has been determined on ttie basis of 
the dass to which a concept common to ttie word i^ and ttie word e^ (ttie concept which belongs to the lowest dass 
among the concepts iriduding ttie word Ik and the word eO belongs in a thesauais stored in the ttiesaurus memory part 
55. That is, the thesaurus is constructed by for example making the dass to which tiie largest concepts t>elong the Otti 
dass and ttien classifying smaller concepts into 1st ttirough 3rd progressively lower classes and dassifying words by 
assigning ttiem to ttie concepts in ttie 3rd dass to which they correspond, and witti reference to ttiis kind of ttiesaurus 
it is detected which of the Otti through 3rd dasses the towest concept common to the word i^ and the word e^ belong 
(hereinafter for convenience ttie C in Ctfi dass will t>e refened to as ttie leveQ. Then, according to wh^er ttiat conce|:rt 
belongs to the Otti, the 1st, the 2nd or tiie 3rd class, tiie distEmce word-distance(i|(,ei() is determined to be 0, 1/3, 2/3 or 
1 respectively. . - ' 

In tills case, it is necessary that when ttie higher dasses are seen from the words there are the same number of 
dasses (in ttie case descrbed above, the four dasses of Otti tivough 3rd) above each word, and therefore the ttiesau- 
rus as a wtide has had to have an as it were re^lar structure. 

Also, because ttie distance word<JlstanceCik.ek} between two words is determined on ttie basis of ttie level of ttie 
dass to which ttie common concept of ttie two words belongs (because it is determined on ttie basis of which of the Otii 
through 3rd dasses ttie dass to which concept belongs is),, tfie ttiesaurus nuist be so constructed ttiat the distance 
between two words having a concept belonging to a certain dass as ttieir common concept and ttie cfistance between 
two words having a different concept t)elonging to the same dass as ttieir common concept is ttie sama 

Rg. 1 6 shows an example of a corventiona! ttiesaurus wherein concepts of words are classified into three dasses, 
Otti through 2rxJ. In the figure, the rectangles represent concepts and the drdes represent words. 

In this kind of ttiesaurus, ttie concept CI common to the words Wa and Wb is different from the concept C2 com- 
mon to ttie words Wc and Wd, but since the concepts CI and C2 both belong to ttie 1st dass, according to ttie meittiod 
of calculating the distance word-distanceCik.ek) of the related art described above, the cOstance between ttie words Wa 
and Wb and the distance between ttie words Wc and Wd are the sama This means that ttie distance in meaning 
between concepts connected by fines is the same between any two concepts, and therefore it has been necessary for 
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the thesaurus to be so constructed that this IS the case. 

However, constructing a thesaurus regularly and also so that the distance between concepts is the same, as 
desCTibed above, although it rmy be possSsle with a small-scale thesaurus, is very difficult in a thesaurus of any sub- 
. slantial size. That is. if the thesaurus is constructed regularly it is difficult to make the distance between concepts fixed, 
5 and if the thesaurus is constructed so that the distance between concepts \s fixed it beconrtes cfifTicult to maintain its reg^ 
ularity. 

Therefore, a way of nwWng possible high-quality translation using a non-regular thesaurus or a thesaurus wherein 
the distance between concepts is not fixed has been being sought 

10 Summary of the Invention 

it is therefore an object of the present invention to overcome the problems descril^ed atxjve and provide a translat- 
ing apparatus and a translating metiiod witti which it is possible to can7 out high-quaTity translation easily. 

To achieve ttie aforementioned object and other objects, tiie invention provides a trarrslating apparatus for translat- 
es ing a first language sentence expressed in a first language into a second language sentence expressed in a second 
language comprising dividing means for dividing the first language sentence into syntax units consisting of predeter- 
mined units of sentence structure, detecting means for detecting examples similar to tiie syntax units of tiie first lan- 
guage sentence outputted from the dividing means, translating means for translating tiie first language sentence In 
syntax units on ttie basis of translations corresponding to the examples detected by the detecting means^ and generat- 
20 ing means for generating a second language sentence compounding the results of translation of tiie syntax units 
obtained by the translating means. 

The invention also provides a translating metiiod comprising a step of dividing a first language sentence into syntax 
units consisting of predetermined units of sentence structure, a step of detecting examples similar to tiie syntax units 
of tiie first language sentence, a step of translating tiie first language sentence In syntax units on ttie basis of transla- 
55 tions con-esponding to tfie detected examples, and a step off generating a second language sentence by compounding 
Uie results of translation of those syntax units. 

The invention also provides a translating method comprising a step of dividing a first language sentence into syntax 
units consisting of predetermined units of sentence structure in stages from large syntax units to small syntax units and 
collating ttie syntax units obtained at each stage and examples, a step of detecting examples corresponding to ttie syn- 
30 tax units on ttie basis of results of the collation, a step of translating ttie first language sentence in syntax units on the 
- k)asis of translations corresponding to tiiose detected examples, and a step of generating a second language sentence 
by compounding the results of translation of those syntax units. ^ . 

The invention also provides a translating apparatus comprising calculating means for calculating a word simOarify 
degree expressing ttie similarity between a word constituting a first language sentence and a word corvstituting an 
3S example on the k>asi8 of the nuniber of words included in a concept common to ttie word constituting the first language 
sentence and ttie word constituting the example in a tiiesaurus wherein words are classified in a tree structure on ttie 

basis of their concepts. . , 

The invention also provides a translating mettiod comprising a step of calculating a word similarity degree express- 
ing tiie similarity between a word constituting a first language sentence and a word constituting an example on ttie basis. 
40 of ttie number of words included In a concept common to the word constituting the first language sentence and the word 
constituting the example in a tiiesaurus wherein words are classified in a tree structure on the t)asis of tiieir concepts^ 
The invention also provides a translating apparatus comprising detecting means for detecting an example similar 
to a first language sentence on ttie basis of a priori probability of the example being used for translating the first lan- 
guage sentence and a transformation prot}ability of the example being transformed into the first language sentence and 
45 used and translating means for translating tiie first language sentence on ttie basis of a translation corresponding to the 
example detected by the detecting means. 

The invention also provides a tianslating metiiod comprising a step of detecting an example similar to a first lan- 
guage sentence on ttie basis of a priori probability of ttie example being used for translating ttie first language sentence 
and a transformation probability of the example beAng tiransfonned into ttie first language sentence and used and a step 
so off translating ttie first language sentence on ttie basis of a translation conresporiding to ttiat detected example. 

The invention also provides a translating apparatus comprising extracting means for by performing syntax analysis 
on a first language sentence extracting grammatical attributes of the first language sentence and detecting means for 
detecting an example similar to the first language sentence taking into account grammatical attrikxites of the first lan- 
guage sentence and of ttie example and translating means for translating the first language sentence on the basis of a 
55 translation corresponding to ttie example detected by the detecting mean& 

The invention also provides a translating metiiod comprisihg a step of by performing syntax analysis off a first lan- 
guage sentence extracting grammatical attributes of the first language sentence and a step of detecting an example 
similar to the first language sentence taking into account grammatical attributes of the first language sentence and of 
the example and a step of translating the first language sentence on tiie basis of a translation corresporxting to the 
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example detected by the detecting means. 

Brief Description of the Drawings 

5 Fig. 1 is a block dagam showing the oonstnjction of an example of a translating apparatus of the related art 

Rg. 2 is a block diagram showing the construction of another example of a translating apparatus of the related art; 
Rg. 3 is a block diagram shewing the construction of anotiier example of a translating apparatus of the related art; 
Rg. 4 is a view showing a thesaurus used In a translating apparatus of the related art; 

Rg. 5 is a bhxk diagram showing an example of the construction of a preferred embodiment of a translating appa- 
10 ratus according to the invention; 

Rg. 6 is a view showing a thesaurus stored in a thesaurus memory 1 2 of Rg. 5; 

Rg. 7 is a flow chart illustrating the operation of the translating apparatus of Rg. 5; 

Rgs. 8A and 8B are further views illustrating the operation of the translating apparatus of Rg. 5; 

Rgs. 9A through 9C are further views illustrating the operation of the translating apparatus of Rg. 5; 
IS Rgs. 1 0A through 1 0C are further views illustrating the operation of the translating apparatus of Rg. 5; 

Rg. 11 is a flow chart illustrating a method for obtaining a priori probability P(Ei); 

Rg. 12 is a flow chart illustrating processing cam'ed out by a converting part 7 shown in Rg. 5; 

Rg. 13 is a flow chart illustrating details of the processing of a step S32 in Rg. 12; 

Rg. 14 is another ftow chart DIustrating details of the processing of the step S32 in Rg. 12; 
20 Rg. 15 is a block diagram showing an example of the construction of another prefOT 

apparatus according to the invention; and 

Fig. 16 is a flow chart illustrating processing canied out in the converting part 7 of Rg. 15. 



Detailed Description of the Preferred Embodmente 

Rg. 5 shows the construction of a preferred errtodiment of a translating apparatus according to the invention. - ' 
Parts in Rg. 5 corresporxfing to parts shown in Rg. 1 have been given the same reference numerals and win not t)e ' ^ 
described in the fblkawing. This translating apparatus* as will be further discussed later, divides a first language sen- 
tence Into syntax units consisting of predtf emrtined units of sentence structure, in stages, first into large syntax units ' 

30 and then into small syntax units, arxj translates the syntax units obtained at each stage using examples, and in this ' ' •• 
sense it can be called a midti-stage example-using translating apparatus. Here, as the first language sentence for ' 
example a Japanese sentence is inputted, and as a second language sentence constituting tiie result of translation of 
the first language sentence lor exarnple an Engfish semence is outputt^^ ^ ^ ^ ^ 

An ffiputting part 1 of this translating apparatus, like the inputting part mentioned above,'o6n8ists of for example a 

3S keyboard or a voice recognition device or a character recognition device, and outputs in the form of text data or the like 
a first language serrtence inputted into it i-lere, the inputting part 1 detects the beginning and end of each word (or mor- 
pheme) constituting the first language sentence, and its part of speech, root and Chinese character reading arxi so on 
(hereinafter, tills information obtained by nrarpheme analysis will for convenience be called nrx)rpheme information), 
and outputs these in the form of text data together with the first language senterica 

40 The output of the inputting part 1 is sipplied to a collating part 2. The collating part 2 collates examples stored in 
a set phrase translation example memory 3 with the first language sentence from tiie inputting part 1 and determines 
> whether or not there is among the examples stored in the set phrase translation example memory 3 one wNch matches 
the first language sentence. When an exannple whkii matches the first langus^e sentence from the ir^sutting part 1 is 
stored in the set phrase translation iaxample memory 3, the collating part 2 reads from the set phrase transtefion exam- 

45 pie memory 3 a translation corresporxJing to this example arxJ supplies it to an outputting part 16. Thus, in this case, 
processing is not caaied out in stbsequent stages of a clause dividing part 4 through to a clause compourxfing part 1 5. 
The translation from the collating part 2 is outputted by the outputting part 16 as the second language sentence. 

When no example matching the first language sentence from the inputting part 1 is stored in the set phrase trans- 
lation example memory 3, the collating part 2 outputs the frst language sentence from the inputting part 1 and its rm- 

so phemeinfonmation to a clause dividing part 4. 

The set phrase translation example menrx)ry 3 stores trarrslation exarrple data made up of examples of for instance 
greetings and other soiled set phrases expressed in the first language (here, as mentioned atxsve. Japanese) and 
corresponding translations consisting of these examples translated into the second language (here, as mentioned 
above, English). Therefore, when the first language sentence is a set phrase (a set pivase stored as an exannple in the 

ss set phrase translation example memory 3). the translation corresporxiing to the example matching that set phrase is 
supplied from the collating part 2 to tiie outputting part 1 6. and When the first language saitence is not a set phrase this . 
first language sentence and its morpheme information is si^splied from the collating part 2 to the clause divicfing part 4. 

When it receives the first language sentence and the morpheme information from the collatirtg part 2, the dause 
divkiing part 4 detennines whether or not there is in the first language sentence a clause matching any of the exanples 
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stored in a dause division pattern translation example meniory 5, and when there is such a clause in the first language 
sentence cuts the first language sentence at this dause and thereby divides the first language sentence into clauses, 
or large syntax units. The dause divWing part 4 suppfies parts of the result (clauses) of the divi^n of the first language 
sentence other than the clause which matches one of the examples stored in the dause division pattern translation 
example memory 5 to a simple syntax analyzing processing part 6 together with the morpheme infbrmatioa Also, the 
dause dividing part 4 reads from the dause division pattern translation example memory 5 the translation correspond- 
ing to the example malcWng the dause used to divide MP the first language sentence and sif^plies this together with 
infonration relating to how the first language sentence was divided up (hereinafter r^enred to for convenience as dause 
division information) to the clause compounding part 15. 

When on the other hand there is in the first language sentence no dause matching an example stored In the daiBO 
division pattern translation example memory 5, that is, when the first language sentence consists of a single dause. the 
clause dividing part 4 supplies the first language sentence unchanged to the simple syntax analyzing processing part 
6 together with its morpheme information. 

The dause division pattern translation example memory 5 stores translation example data made up of exanples in 
clause units expressed in the first language and oon^spondlng translations consisting of these examples translated Mo 
the second languag& Here, examples of so-called patterned dauses such as for example 'DEKITARA* frf possible'} are 
stored in the dause division pattern translation example memory 5. 

The simple syntax analyzing processing part 6, by carrying out soiled shallow syntax analysis on a dause con- 
stituting tfie first language sentence from the dause dividing part 4 on the basis of the morpheme information also from 
the dause dividing part 4, for example using language^erning rules (knowledge) described as regular grammar, 
divides the dause (when a plurality of dauses are supplied from the dause dhoding part 4, each of tiie dauses) into 
phrase units. That is. here, tiie simple syntax analyzing processing part 6 divWes ip tiie first language sentence dause 
from the clause divkling part 4 into for example noun phrases, verb phrases and other phrases (for exanple adverb 
phrases and sentence modifiers and so on), and supplies these to a converting part 7. 

The simple syntax analyzing processing part 6. by carrying out syntax analysis, also canles out exiraction of grarih 
matical attributes fbr each phrase, such as for example syntactic head (tiie main word of the phrase (for exanple the 
important part of a noun phrase (a nxxJified word or the like) or when a noun phrase consists of a single noun, that 
noun) and modality (for example information relating to potential, conditional, desiderative and so on). Also, the dnple 
syntax analyzing processing part 6 detects finkage relationships between phrases constituting tiie dause, and other 
syntax Information (hereinafter called dause syntax information). The grammatical attributes and clause syntax bitor- 
mation are supplied to tiie converting part 7 together with the phrases obtained as described abwa 

The converting part 7 detects phrase unit examples most similar to the phrases of the first language sentence from 
the simple syntax analyzing processing part 6 from a noun phrase translation example memory 8, a verb phrase trans- 
lation example memory 9 and an other trandation example memory 10. and cames out translation in phrase units on 
the basis of trandations corresponding to these exannples. : 

That is, the converting part 7 controls a similarity degree calculating part 1 1 to calculate word similarity degrees 
expresdng the similarity In meaning between the words in ttie examples stored in ttie noun phrase translation exarvple 
memory 8, ttie verb phrase translation example memory 9 or ttie ottier translation exanple memory 10 and ttie words 
constituting ttie phrases from ttie sinple syntax analyzing processing part 6. and on ttie basis of ttiese word sinilarity 
degrees and otfier parameters 0br example ttie grammatical attrfoutes outputted by ttie sinple syntax analyzing 
processing part 6) detects tiie examples most similar to tiie phrases from ttie simple syntax analyzing processing part 
6. In ttie converting part 7, when an exanple matching a phrase from tiie simple syntax analyzing processing part 6 is 
stored in the noun phrase translation example memory 8, ttie veib phrase translation exanple memory 9 or the ottier 
translation example memory 10, ttiis example is detected as ttie example most similar to tiie phrase from ttie sinple 
syntax analyzing processing part 6. 

Also, by refening to a translation dictionary 13, ttie converting part 7 replaces ttiose of ttie words constituting ttie 
trandation corresponding to tiie nrxjst similar exanple ttius detected which do not conrespond to words constituting ttie 
first language sentence witti translations of ttiose words, and outputs trandalions of phrases obtained in ttiis way to a 
phrase compounding part 14 togetiier witti ttie dause syntax information from ttie dmple syntax analyzing processing 
parts. 

The converting part 7 carries out the processing descn*bed above for each noun phrase, verb phrase and ottier 
phrase supplied from ttie sinple syntax analyzing processing part 6. Detection of ttie phrase unit exanple most sintiar 
to a noun phrase, a verb phrase or an other phrase Is carried out witti reference respectively only to the noun phrase 
translation exanple memory 8. ttie verb phrase ttanslation example memory 9 or ttie ottier translation exanple memory 
10. 

The. noun phrase translation example memory 8. ttie verb phrase transiatidn example menx)ry 9 and the ottier 
trandation example memory 10 respectively store examples in noun phrase units, verb phrase units and ottier phrase 
units. 

The similarity degree calculating part 11, under the control of the converting part 7. on the bads of a tiiesaurus 
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stored in a thesaurus memory 1 2, calculates degrees of surdlanty between the words constituting the exanples stored 
in the noun phrase translation example memory 8. the verb phrase translation example memory 9 or the other transla- 
tion example memory 10 and the words constituting the phrases outputted by the simple syntax analyzing processing 
part 6. The thesaurus memory 12 stores a thesaurus wherein words are classified in a tree structure on the t)asls of 

5 their concepts. The translation dictionary 1 3 stores words of the fffst language in association with corresponding words 
of the second language (words of the first language translated into the second language (translations)). 

The phrase compounding part 14, by compounding the translations of noun phrases, verb phrases and other 
phrases supplied from the converting part 7 on the basis of the dause syntax information also suppGed from the con- 
verting part 7 arxl tan^age^oveming mies of the second language, produces clauses of the second language and 

10 supplies them to the dause compounding part 15. The clause compouncSng part 15 compounds the dauses and 
thereby produces a second language sentence constituting the result of translation of tiie first language sentenca 

That is, when the first language sentence is made up of a plurality of dauses, results of translation of a plurality of 
clauses are supplied to the dause compounding part 1 5 from tiie phrase compounding part 1 4. Also, a result of trans- 
lation of a dause may be supplied to tiie clause compounding part 1 5 not only from the phrase compouncfing part 14 

IS but also frbni the clause dividing part 4. By oonrpounding (corTt}ining) tiie results of translation of a plurality of dauses 
supplied to it in this way the dause compounding part 15 produces a second language sentenca The second language 
semence produced by ttie dause conpounding part 15 is supplied to tiie outputting part 16 a^ 

From the dause dividing part 4, as mentioned above, besides a result of translation (a translation) of a dause. 
clause division infonnation is also supplied to the clause compounding part 15, and ttie translations of dauses outpiit- 

20 tedbytiiephrasecompoundingpart14areoonipoundedwitiittietrandatipnofm^ 
ing part 4 on tfie basis of ttiis dause division infbrmation. 

Next, with reference to Fig. 6. tiie thesaurus stored in the ttiesaurus memory 12 will be described. In the ttiesairus 
memory 1 2 is stored a tiiesaurus wherein words are classified in tiie same way as in for example tiie EDR (Japan Bee- 
tronic Dictionary Research Institute) concept dictionary, and Rg. 6 shows a part of this ttiesaurus (Source: 'Japan Elec- ' 

25 tronic Dictionary Research Institute, EDR Electronic Dictionary Produd Bdtion (I.Otti Edition), 1995% That is, Fig. 6 
shows tiiose words (endosed by ellipses in ttie figure) in ttie ttiesaurus stored in ttie ttiesaurus memory 12 which cari - 
be expressed as *go' (in ttie figure not ttie words themselves but rattier ttieir senses are shown), and ttieir higher con- 
cepts (endosed by rectangles in tiie figure). 

This ttiesaurus is not so constructed ttiat when ttie dasses above are seen firom ttie words ttiere are ttie same 

30 number of classes above each word. That is, when for example as in ttie case shown In Rg. 4 ttie dass to which ttie - 
highest concept belongs is made ttie Otti dass and the dasses below it are successively called ttie 1 st dass, the 2nd 
class, and so on, ttie word W1 having ttie meaning (sense) Hme passes* belongs to ttie 3rd-dass concept of 1he pass- 
ing of time' and ttie word having the meaning 'do gradual!/ belong 

has been until now*. Also, in ttiis ttiesaurus, for example ttie word W3 having ttie meaning 'animal^rson dies' and ttie 
3S word W4 having ttie meaning 'anrive at destination' are induded in a piurafity Of concepts in t^^ - 
word W3 is induded in botti of ttie 4tti-dass concepts of t)hysiological phenomena of animals' and life going ouf , and 
the word W4 » induded in botii of ttie 7ttiH:^ass concepts of ^siiiject move^ 
Thus, this thesaurus is not constructed regularly. 

Also, tills ttiesaurus is not so constructed ttiat the distances between concepts joined by fines are ttie same. That ' 
40 is, for example ttie concept CI Csubjectnioves to end pointO is connected to b^ 
jectO and Canlval at end pdnt^, but ttie distance between ttie concepts CI and 
as the distance between ttie concepts CI and C3. 

In tiie similarity degree calculating part 1 1 of Fig. 4, as will be furttier discussed later, word similarity degrees accu- 
rately reflecting similarities between words are calculated notwittistanding ttiat ttie ttiesaurus Is constructed In the way 
45 described abova 

Next with reference to ttie flow chart of Rg. 7. the operation of ttie translating apparatus shown In Rg. 5 wifl be 
described. When a first language sentence Is inputted into ttie inputting part 1 , in a step SO, ttie inputting part 1 converts 
this first language sentence into text data and generates nx)rpheme information of the first language sentence and stp- 
plies ttiese to ttie collating part 2. In a step SI , ttie collating part 2 collates ttie examples stored in ttie set phrase trans- 
50 lation example memory 3 witti ttie first language sentence from ttie inputting part 1 and thereby determines whettier or 
not there is an example stored in ttie set phrase translation exanple memory 3 which matches ttie first language sen- 
tence from tfie inputting part 1. 

When in ttie step S1 It is determined ttiat there is an example matching ttie first language sentence from the input- ' 
ting part 1 stored in ttie set phrase translation example memory 3. processing proceeds to a step S2 wherein ttie first 
55 language sentence is replaced witti ttie translation corresponding to ttiis matching example and ttien proceeds to a step 
St3. In ttie step SI 3, ttie translation obtained in ttie step SI is supplied froiia ttie collating part 2 to the outputting part 
1 6. which ttien outputs a second language sentence consisting of ttie first language sentence translated into ttie second 
language, and processing ends. 

When on the other hand it is determined in the step SI ttiat ttiere is no example stored in ttie set phrase translation 
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example memofy 3 which matches the first language sentence from the inputting part 1 , this fust language senterice » 
supplied tog^er with Hs morpheme Information from the collating part 2 to the clause dividing part 4. "men, in the 
clause dividing part 4, in a step S3, it is determined whether or not there is among the clauses constituting the f irst lan- 
guage sentence a dause which matches one of the clause unit examples stored In the clause pattern translation exam- 
ple menrwry 5. 

When in the step S3 it is determined tfiat among the clauses constituting the first language sentence there is one 
which matches a clause unit example stored in the clause pattern translation example memory 5, processing proceeds 
to a step S4 and the clause dividing part 4 cuts the first language sentence at that dause. thereby divicOng the first lan- 
guage sentence into clause units, and generates dause division informatioa The clause dividing part 4 then reads from 
the clause pattern translation example memory 5 the translation of the dause among the dauses constituting the first 
language sentence which matches a dause unit example stored in the dause pattern translation example memory Si 
and outputs this translation to the dause compounding part 15 together with the dause division information. Alsa the 
dause dividing part 4 supplies the remaining dauses constituting ttie first language sentence to ttie simple syntax.ana-. 
lyzing part 6 together witii the morpheme information, and processing then proceeds to a step S5. 

When on the other hand it is determined in the step S3 ttiat there is among the clauses constituting the funsl larh. 
guage sentence no dause which matches a dause unit example stored In the dause pattern translation example mem- 
ory 5, that is, when for example the first language sentence is made up of a single dause. the clause dividing part 4 
outputs tiie first language sentence to the simple syntax analyzing part 6 together with the morpheme infomiation and 
processing skips tiie step S4 and proceeds to the step S5. 

In tiie step S5. in tiie simple syntax analyzing part 6. syntax analysis is carried out on clauses constituting the fkst 
language sentence from the dause dividing part 4 on the basis of rTx>ipheme infomiation also from the dause dividing 
part 4 and using language-governing rules (loiowledge) off the first language, and as a result of this analysis the clauses 
of the first language sentence are divided into noun phrases, verb phrases and ottier phrases, and grammatical 
attributes of each phrase and clause syntax information such as linkage relationships between phrases constituting the 
clauses are generated. The phrases, the grammatical attributes and tiie dause syntax information are then suppOed 
from ttie simple syntax analyzing part 6 to the converting part 7 and processing proceeds to a step S6. 

In the step S6. in the converting part 7, each of ttte phrases supplied firom the simple syntax analyzing part 6 and 
the stored content of tiie noun phrase translation example memory 8, the verb phrase translation example memory 9 
or the other translation example memory 10 are collated and it is thereby detemnined whetiier or not a phrase unit 
example matching a phrase supplied from tiie simple syntax analyzing part 6 is stored in ttie noun phrase translation 
example memory 8, the verb phrase translation example memory 9 or ttie ottier translation example memory 10. When 
in the step S6 it is determined that an example matching a phrase supplied fi'orh the simple syntax analyzing part 6 Is v 
stored in the noun phrase translation example memory 8. ttie yerb phrase translation example memory 9 or ttie ottier 
translation example memory 10, processing proceeds to a step S7 and ttie converting part 7 reads a translation corre- 
sponding to ihts matching example from ttie noun phrase translation example memory 8, ttie verb phrase translation 
example memory 9 or ttie other translation example memory 10 and outputs it to ttie phrase compounding part 14. and 
processing ttien proceeds to a step SI 0. 

Here, noun phrases, verb phrases and other phrases for example dbtained from tiie simple syntax analyzing part 
6 by sentence unit examples being inputted into the inputting part 1 are respedively stored in the noun phrase transla- 
tion example memory 8. ttie verb phrase translation example memory 9 and tiie other translation exan^e memory 10. 
Also, the noun phrases, verb phrases and ottier ptvases obtained in ttiis are stored in assodation witti corresponding 
grammatical attributes. 

That is. phrase unit examples are stored in ttie noun phrase translation example memory 8, the verb phrase trans- 
lation example memory 9 and ttie ottier translation example memory 10 togettier witti as it were abstrad attrflxites 
ttiereof obtained as a result of syntax analysis, called grammatical attributes. Therefore, the converting part 7 can be 
given generalized linguistic kncwledge and made to perform collation of phrases from ttie simple syntax analyzing part 
6 (hereinafter for convenience ^led input phrases) witti ttie phrases stored in the noun phrase translation example 
memory 8. ttie verb phrase translation example memory 9 or the ottier translation example memory 10 (hereinafter for 
convenience called example phrases) taking into consideration grammatical attributes of ttie phrases (grammatical 
attributes of the input phrases and of ttie example phrases), and eff ident processing becomes possible as a result 

Specifkslly. when tor example ttie verb phrase 'YOYAKU O SHITE ITADAKITAI NO DESU GA' (Td like you to make 
a reservation") has been inputted into ttie simple syntax analyzing part 6. as grammatical attributes (or "dot-name val- 
ues') for example (geoind-sufTix ITADAKU) (deskJerative +) (extended-predicate +) (s-part GA) and (formal +) are 
obtained. Here, ttie most concise expression (as it were, the expression equivalent to a rooQ of ttie verb phrase 
•YOYAKU O SHITE ITADAKITAI NO DESU GA' is 'YOYAKU O SURU* (1o make a reservation"), and ttie dot-name value 
(gerund-suffix ITADAKU) me^ ttiat ITADAKU' has been added like a suffix to ttiis 'YOYAKU O SURU", ttie slot-name 
value (deskierativa •¥) means ttiat an expression expressing desire (here, TAI") has been added, the slot-name value 
(extended-precficate •►) means tiiat an expression for adding a predicate (here, 'NO DESU*) has been added, ttie stot- 
name value (s-part GA) means ttiat an ending parttde (here. 'GA*) has been added, and the slot-name value (formal -f) 
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means that the sentence ending is polite (here, that 'NO DESIT has been used as the sentence ending and not for 
example the less pome *NO DA*). 

In this case, as generafized linguistic knowledge, for example the knowledge that the 'NO' of the expresskxi con'e- 
spondng to the slot-name value (extended-predfcate +) sometimes becomes "N' is given to the simple syntax analyzing 
part 6. and when for example 'YOYAKU O SHITE ITADAKITAI 'N DESU GA* (also meaning Td Gke you to make a reser- 
vation'), wherein the 'NO* In the verb phrase mentk)ned above has become 70*. Is inputted as an input phrase^ again the 
same slot-name values as in the case above of gerund-suffix ITADAKU) (deslderative +) (extended-predteate +) (s-part 
OA) and (formal 4-) are outputted. 

Therefore, if the verb phrase 'YOYAKU O SHITE ITADAKITAI NO DESU GA* \s stored together with its slot-name 
values as an example phrase in the verb phrase translatkm example memory 9, in the converting part 7 the example 
phrase 'YOVAKU O SHITE ITADAKITAI NO DESU OA' is determined to match the Irput phrase 'YOYAKU O SHITE 
ITADAKITAI 'N DESU OA*. 

Also, when "VOYAKU O SHITE FTADAKITAI GA* (stifl meaning Td like you to make a reservation') consisting of the 
first-mentioned verb phrase YOYAKU O SHITE ITADAKITAI NO DESU OA' w'rth the expressfons corresponding to the 
slot-name values (extended-predteate +) and (formal +) omitted Is inputted as an input phrase, ttie slot-name values 
(gerund-suffix rrADAKU) (deskJerative 4^) (8-part GA) are outputted by the 

let it be supposed that for example the knowledge that expressions con-esponding to ttie slot-name values 
(extended-precBcate +) and (formal +) can be ignored has been given to tfie converting part 7 as generalized linguistic 
knowledga Then, assuming ttiat the first-mentioned verb phrase YOYAKU O SHfTE ITADAKITAi NO DESU GA* is 
stored together witti its stot-name values as an example phrase in the verb phrase translation example memory 9, in 
the converting part 7 collation is camed out ignoring the (extendedi>redicate +) and (formal +) among tfie slot-name 
values of tfie example phrase YOYAKU O SHFTE ITADAKITAI NO DESU GA\ and as a result tfie exanple phrase 
YOYAKU O SHITE ITADAKITAI NO DESU OA' is detemiined to match tfie input phrase Yt3YAKU O SHITE ITADAKITAI 
OA'. 

In this way. even if an input phrase does not match an exarrple phrase conpleteiy it is possible to obtain an exam- 
ple phrase expressing tfie same meaning as the input phrase. That is. by taking grammatical attributes into consklera- 
tion, it is possible to carry out processing in tfie converting part 7 ignoring tfie 'HO DESU' in tfie example phrasa 

Also, as a result of this, it becomes unnecessary to prcvkie example phrases conesponding to superfK^ial varia- 
tions; tfiat is, it becomes mnecessary to store in tfie verb phrase translation example memory 9 as exarrple phrases in 
addition to tfie verb phrase YOYAKU O SHITE ITADAKITAI NO DESU GA* tfie verb phrase YOYAKU O SHITE ITADAK- 
ITAI OA* witfi tfie *N0 DESU' removed, tfie verb phrase YOYAKU O SHITE RADAKITAI DESU GA' witfi tfie "NO* 
removed or ttie verb phrase 'YOYAKU O SHITE ITADAKITAI 'N DESU GA* wherein tfie 'I^' has been changed to 
and tfierefore exarrple phrases can be stored in the noun phrase translation example memory 8, the verb phrase trans- 
lation exarrple memory 9 and ttie other translation exanple memory 10 efficiently. Also, tfie noun phrase translation 
exarrple memory 8, the verb phrase translation example memory 9 and the other translation example memory 10 can 
be called a data base of example phrases, and when tfie approach described above is adopted it becomes possUe to 
cany out tfie production, rnainienance arxf irrproverrierit of tfiis data base easn^^ 

When on the otfier hand it is determined m tfie step S6 tfiat no exanple phrase in tfie noun phrase translation 
exarrple merrx>ry 8, tfie verb phrase translation exanple memory 9 or tfie otfier translation exanple memory 10 
matches an input phrase (a phrase supplied from tiie simple syntax analyzing part 6). processing proceeds to a step 
S8 and tfie converting part 7. by controlling the similarity degree cafoulating part 1 1 . causes word simOarity degrees to 
be calculated and on tfie basis of tfiese word similarity degrees and otfier parameters computes translation suitabifities. 
which will be furtfier discussed later, expressing tfie similarity (in meaning) between ttie input phrase and each of tfie 
exarrple phrases stored in tfie noun phrase translation example memory 8. the verb phrase translation example mem- 
ory 9 or tfie otfier translation example memory 10. The converting part 7 ttien detects the example phrase of ¥vhich tfie 
translation suitab'lity is tfie highest and processing proceeds to a step 

In tfie step S9. tfie converting part 7 produces a translation (hereinafter for convenience called an input phr^e 
translation) of the input phrase on the basis of the translation corresponding to tfie example phrase detected in the step 
S8 and witfi reference to tfie translation dk:tionary 13. and outputs it to tfie phrase compounding part 14 togettier witfi 
the clause syntax information from tfie simple syntax analyzing part 6. 

Here, ttie processing of tfie aforementioned steps S6 ttirough S9 is earned out witfi respect to tfie example phrases 
stored in either the noun phrase translation example memory 8, the verb p>hrase translation exarrple memory 9 or the 
otfier translation example memory 10, depending on whether the input phrase Is a noun phrase, a verb phrase or an 
otfier phrase respectively. Therefore, because tfie exanple phrases that are made objects of processing are Smited 
according to tfie type of tfie input phrase, it is possfole to increase tfie translation processing speed. 

When tfie phrase Compounding part 1 4 has received translations (input phrase translations) qH all tfie input phrases 
constituting a certain dause. in a step S10 It generates a clause of tfie second language by compounding these input 
phrases on the t>asis of clause syntax information also supplied from the converting part 7 arKi supplies it to the dause 
compounding part 15. 
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After that processing proceeds to a step S1 1 and the dause compounding part 15 determines whether ornot ttie 
first language sentence is made up of a pluraTity of daises. When in the step S1 1 it Is determined that the f Irel language 
sentence is made up of a plurality of clauses, that is, when a plurality of clauses (second language clauses) have hem 
supplied to the clai^e compounding part 15 from the phrase compounding part 14, or when clauses have been sip- 
plied to the clause conpoiaxling part 15 from txrth the clause cfividing part 4 and the phrase corrpounding part 14, 
processing proceeds to a step S12 and the clause compounding part 15 generates a second language sentence by 
compounding these clauses and supplies It to the oulputting part 16. and then processing proceeds to a step S13. 

When in the step S1 1 1t is determined that the first language sentence is not made ip of a plurality of clauses. Thai 
is, when only one dause has been supplied to the clause compounding part 1 5 from the phrase compounding part 14, 
the dause compoimding part 15 suppHes this dause (a second language sentence) unchanged to the oulputting part 
16 and processing skips the step S12 and proceeds to the step S13. In the step S13, the second language sentence 
supplied from the clause compounding part 15 Is outputted, and processing ends. 

Because, as desaibed above, in the clause dividing part 4 and the simple syntax analyzing part 6 the first language 
sentence is divided In stages first into dauses, which are large syntax units, and then into phrases, wftich are smaD syn- 
tax units, and translation of the syntax units obtained at each stage, i.e. dauses and phrases, is canied out using exam- 
ples, compared to a case wherein translation of the entire first language sentence is carried out in one stage, it is 
possible to increase the translation processing speed. 

In the dase described above, division of the first language sentence into dauses is earned out when there s among 
the clauses constituting the first language sentence a dause which matches a clause unit example stored in the dause 
pattern translation example memory 5 and on the ba^ of this clause; however, division into dauses can also be carried 
out irrespective of tiiis Wnd of condition. 

Next, the processing canied out t}y the translating apparatus of Fig. 5 when for example IDEKiMASHITARA 
ICHIGO NO AISU KUREEMU O ONEQAI SHITAI »N DESU GA' Clf possible, may I have some strawberry ice cream?) 
is inputted will be described with reference to Rgs. 8A ttirough IOC. 

By morpheme analysis being carried out in tiie inputting part 1. for example the Wnd of morpheme information 
shown in Fig. 8A is extracted from this first language sentence *DEKIMASHITARA ICHIGO NO AISU KUREEMU O 
ONEQAI SHfTAI *N DESU GA*. In Rg. 8A, for reference, modality case infbnmation in the grammatical attributes of the 
first language sentence is also shown. 

The first language sentence 'DEKIMASHITARA ICHIGO NO AISU KUREEMU 0 ONEGAI SHITAI *N DESU OA' 
and its morpheme infbmiation are supplied from the inputting part 1 to ttie collating part 2. Now, assuming that *DEKI- 
MASHITARA ICHIGO NO AISU KUREEMU O ONEGAI SHITAI N DESU GA' is not stored as an example in the set 
phrase translation example memory 3. the first language sentence DEKIMASHITARA ICHIGO NO A|SU KUREEMU O 
ONEGAI SHITAI 'N DESU GA' and its morpheme information are supplied from the collating part 2 to the dause divid- 
ing part 4. 

In tiie clause dividing part 4. collation of ttie first language sentence 'DEKIMASHITARA ICHIGO NO AISU 
KUREEMU O ONEGAI SHITAI 'N DESU GA' witii the dause unit examples stored in the clause pattern traralation . 
example memory 5 Is canied out. Here, supposing now that for example translation example data of the kirid shown in 
Rg. 6B made up of an example 'DEKIMASHITARA X" and its translation X if possible' are stored In correspondence 
with each other in ttie dause pattern translation example memory 5, because in ttie first language sentence DEKIMAS- 
HITARA ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU GA' tfiere is a dause matching ttie example 'DEKI- 
MASHITARA' (in ttiis case, 'ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU GA' is recognized as matcMng 
the X part), ttie fffst language sentence *DEKIMASHITARA ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU 
GA' Is dvided up In ttie dause dividing part 4 Into the clauses 'DEKIMASHITARA' and 'ICHIGO NO AISU KUREEMU 
O ONEGAI SHITAI 'N DESU GA'. 

The translation "X. if possible' corresponding to ttie clause 'DEKIMASHITARA' matching a clause unit axanple 
stored in the dause pattern translation ^cample memory 5 is then supplied from ttie clause dividing part 4 to the dause 
compounding part 15. Also, ttie remaii^ng clause of ttie frst language sentence. 'ICHIGO NO AISU KUREEMU O 
ONEGAI SHITAI 'N DESU GA', is supplied from ttie clause dividing part 4 to ttie sinple syntax analyzing part 6. 

In the simple syntax analyzing part 6. by ttie syntax of ttie clause 'ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 
'N DESU GA* being analyzed, it Is recognized that 'ICHIGO NO AISU KUREEMU O' constitutes one noun phrase and 
XDNEGAI SHITAI 'N DESU GA' constitutes one verb phrasa Also, in ttiis case, it is recognized tiiat ttie noun phrase 
'AISU KUREEMU O* is the object of ttie verib phrase 'ONEGAI SHITAI 'N DESU GA'. On ttie basis of ttiese recogrvtion 
results, ttie dause 'ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU GA' is divided into phrase units. 

Here, for example alttiough ttie translation of ttie verb TORU' in ttie sentence 'OKANE O TORU' (to take mone/) 
is 'take' or 'get', ttie translation of ttie verb TORU' in ttie sentence 'CHOUSHOKU O TDWJ' (lo have breakfast^ is 'ha^' 
or 'eat. Thus witti a vert) (or a vert) phrase) if is sometimes difficult to determine its translation if ttie object sUaject or'< 
complement Is not Known. For ttiis reason, in ttie simple syntax analyzing part 6. in cases like this, when necessary an 
object, subject or complement or ttie like is induded witti ttie verb (predicative part) to make one verb phrasa 

That is, in ttiis case, ttie dause 'ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU GA' is divided into ttie 
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noun phrase 'ICHIGO NO AISU KUREEMU O' and the verb phrase, including an object, of 'AISU KUREEMU O 
ONEQAI SHITAI 'N DESU GA*. 

Also, in the sinple syntax analyzing part 6. granrnatical attributes of the phrases are generated (recognized) and 
assigned to the phrases. 

That is, for the verb phrase MCHIGO NO AISU KUREEMU O*. for example slot-name values to the effect that 
•ICHIGO Na is a mocfifier of 'AISU KUREEMU O' and 'AISU KUREEMU O' is modrfied by "ICHIGO f^O- are produced 
(however, because in this case these gramniatical attributes are not used in subsequent r»rocessing they will not be 
referred to peulicularly in the follcwing). 

Also, for the verb phrase 'AISU KUREEMU O ONEGAI SHITAI 'N DESU GA\ as shown in Fig. 9A, a slot-name 
value (root ONEQAI SURU' -i- des'derative) meaning that ONEGAI SHnAI 'N DESU OA* Is a deslderative Ibmi of the 
root ONEGAI SURU' and a slot-name value (NPl 'AISU KUREEMU O") meaning that Hs object is ice cream (a food) 



The noun phrase MCHIGO NO AISU KUREEMU O' and the verb phrase 'AISU KUREEMU O 0NE(3AI SHITAI tl 
DESU GA' and th^r ^ammatical attributes thus obtained are suppfied from the simple syntax analyzing part 6 to the 
converting part 7. 

In the converting part 7, each of the phrases supplied firom the simple syntax analyzing part 6 and the stored con- 
tent of the noun phrase trar^lation exanple memory 8. the verb phrase translation example memory 9 or the other 
translation example memory 10 are collated and the example phrases nrx)st similar to the phrases supplied from the 
simple syntax analyzing part 6 are detected (searched) from the noun phrase translation example memory 8, the verb 
phrase translation exanple meniory 9 or the other translation exaiiple me^^ 

That is, when for instance in the verb phrase translation example memory 9 the example TANAKA SAN O ONEGAI 
SHITAI 'N DESU OA' fMr. Tanaka, please*) is stored In association with the slot-name values (root ONEQAI SURU' + 
deslderative) and NPl TANAKA SAN O' and also in con-espondence witti tiie translation of ttiis, 'may I speak to NPl ?, 
as shown in Rg, SB. and ttie example 'KOUHEE O IPPAI ONEQAI SHITAI 'N DESU OA' Is stored In asociatidn with 
the slot-names (root ONEQAI SURU* -i- deslderative) and NPl 'KOUHEE O IPPAI' and also in con^espondence with tiie 
translatton of this, "may I have NPl T, as shown in Rg. 90, tiien from a comparison of the distance in meaning between 
the objects 'AISU KUREEMU O' and TANAKA SAN O' and the distance in meaning b^een ttie objects 'AISU 
KUREEMU O' and 'KOUHEE O IPPAF, tiie example 'KOUHEE O IPPAI ONEQAI SHITAJ "N DESU GA' Is detected as 
the example most similar to ttie verb phrase 'AISU KUREEMU O ONEGAI SHITAI 'N DESU OA'. 

The translation 'may I have NPl T corresponding to tills example 'KOUHEE O IPPAI ONEQAI SHITAI 'N DESU QA' 
Is then read from ttie verb phrase translation example memory 9 and supplied from the converting part 7 to the ptvise 
compounding part 14 as ttie translation of tiie verb phrase 'AISU KUREEMU O ONEQAI SHITAI 'N DESU QA'. 

Also, in ttie converting part 7, supposing ttiat for instance ttie example 'SAKUJITSU NO GYUUNYUU* and its trans- 
lation 'yesterday's milk*, ttie example "WATASHI NO HON' and its translation 'my book* and ttie exan^e 'KOUHEE NO 
AISU KUREEMU' and its translation 'coffee flavored tee cream' are respectively stored in correspondence witti each 
ottier in ttie noun phrase translation example memory 8. as shown m Rg. 10A. "KOUHEE NO AISU KUREEMU" fe 
detected as ttie example most similar to ttie noun phrase 'IGHIQO NO AISU KUREEMU O' from ttie sinple syntax ana- 
lyzing part 6, and tfie translation 'coffee flavored ice aeam' corresponding to ttiis is read from the noun phrase transla- 
tion example memory 8 

In ttiis case, in ttie converting part 7 it is recognized ttiat in ttie noun phrase 'ICHIGO NO AISU KUREEMU O' ttie 
XOUHEP in ttie exanple "KOUHEE NO AISU KUREEMU' has become "ICHIGO", and "ICHIGO". which Is not in ttie 
example, is found up by refaring to ttie translation dictionary 13. 

Here, it will be assumed tiiat 'ICHIGO' and its translation 'strawbeny are stored in con-espondence witti each oBier 
in ttie translation dictionary 1 3 as shown in Rg. 1 0B. 

When ttie converting part 7 ftfxls 'ICHIGO' in ttie translation dictionary 13. it reads out ttie 'strawberr/ stored in cor- 
respondence witti it Then, tiie translation "coffee' off ttie 'KOUHEE'. corresponding to 'ICHIGO'. in tiie translation "coffee 
flavored ice cream' read from the noun phrase translation example memory 8, is replaced witti ttie 'strawt^erry* read 
from ttie translation dictionary 13, and ttie translation 'strawberry flavored ice cream' of ttie noun phrase 'ICHIGO NO 
AISU KUREEMU O' is thereby generated. This translation "strawberry flavored ice cream" is supplied frmi ttie convert- 
ing part 7 to the phrase conpounding part 14. 

In ttie phrase compounding part 14, ttie translation '^trawt>erry flavored ice cream* of the noun phrase 'ICHIGO NO 
AISU KUREEMU O' and ttie translation 'may I have NPl r of ttie verb phrase 'AISU KUREEMU O ONEQAI SHFTAI 'N 
DESU QA* are compounded and the translation 'may I have stirawberry flavored ice aeamr of ttie clause 'ICHIGO NO 
AISU KUREEMU O ONEGAI SHnAI "N DESU GA' Is ttiereby generated, and ttiis is suppDed to ttie clause compound- 
ing part 15. 

When ttie clause compounding part 15 recces ttie translation "may I have strawberry flavored ice cream?" of ttie 
clause -ICHIGO NO AISU KUREEMU O ONEGAI SHITAI "N DESU QA' and ttie translation X if possible' of ttie clause 
'DEKIMASHITARA*. it compounds ttiem. (n this way. ttie second language sentence 'may I have strawbenry flavored ice 
cream, if possible?", consisting of ttie first language sentence "DEKIMASHITARA ICHIGO NO AISU KUREEMU O 



are produced. 
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ONEGAI SHfTAI 'N DESU GA' translated into the second language, is generated. 

Next the method of detection of the example phrases most similar to the input phrases used in the converting part 
7 will be described in detail. Now, the words constituting an input phrase I wiO be expressed, from the beginnfrig of the 
phrase, as iw,. iwa. • • iw^, and their slot-name values will be expressed as if^, ifg. • • • , ifQ. Similarly, the words 
constituting an example phrase E will be expressed, from the beginning of the phrase, as ew^. ew2. • • • ewM and 
their slot-name values will be expressed as ef i . ef2. • • • . efp Here. M and N respectively are the mjni)ere of words 
constituting the example phrase E and the input phrase I. and P and Q respectively are the numbers of grammatical 
attributes of (extracted from) the example phrase E and the input phrase I. 

The example most similar (similar in meaning) to an Irput phrase is the one most suitable to be used in translation 
processing of that Input phrasa Now, if the probabifity of a certain example phrase E being suitable Ibr translation of an 
Input phrase I is called its translation suitability, tills translation suitability can be expressed as a conditional probabOlty 
of the example phrase as P(Examplejlnput). Here, PQ expresses probabflity and Example and Input respectively mean 
a set of example phrases and a set of input phrases. 

The example phrase most similar to an input phrase Is that for which this translation suitability P(ExaryY)le|lnpu9 is 
the highest, and when this is written E^ax. thfe example phrase E^a^j can be obtained from Exp. (2). 

Efnax=n^axEeExampleP(E|l)] (2) 

The right side of Expi (2) means an operation obtaining the E for which the value inside the Q is a maximum. 
The translation suitability P(Example|lnpuf) can be expressed according to Bayes law as shown In Exp. (3). 

P(Example|lnput)=P(Example)P(Input|Example)/P(lnpuO (3) 

Here, when finding tiie example phrase E (E^ax) n»st similar to a certain input phrase I, because tiiere is only one 
Input phrase I, that is. Input = I, the denominator P(lnput) of Exp. (3) is 1 and can be ignored. Therefore, from Expi (2) 
and Exp. (3), it is possible to obtain the example phrase E^ most similar to the input phrase I using the fbllowffig Exp. 



E^=maxE€Exanple[P(E)P(l|E)l (4) 

F=rom Exp. (4). to obtain the example phrase having ttie greatest translation suitability with respect to the iiVHit 
phrase, it is necessary to obtain the probabilities P(E) and P(I|E). 

Here, tiie probability P(E) represents the priori probability of tfie example E (Ee Exanple) being used for translation. 
The probability (conditional probability) P(I|E) represents the probability (hereinafter for convenience referred to as the 
transformation probability) of ttie example phrase E being transformed into the input phrase I and used. 

The priori probability P(E) must con-espond to tiie suitability of the example phrase E being used in any input 
phrase, and tiierefbre should for example reflect the following kinds of factor. That is. first it is necessary to reflect in the 
priori probability P(E) the factor of what kinds of expression are frequently used in tiie domain (the type of eofiresshxi 
(writing) inputted into tiie translating apparatus (for example writing appearing in machine manuals, or newspaper arti- 
cles)) in whk:h the translating apparatus is to be applied. 

Specifically, example phrases E corresponding to expressions frequentiy used in tine domain must be given higher 
priori probabilities P(E) than example phrases E con-esponding to expresstons only rarely used and expres^kxis having 
meaning content having little relation to tiie domain. 

Next, it is necessary to reflect in the priori probability P(E) tiie factor of what ottier exanple phrases are stored In 
the translating apparatus besides tiial example phrase E (In tiie translating apparatus of Fig. 5, because the exarrple 
phrases are divided into noun phrases, verb phrases and other phrases and respectively stored in tiie noun phrase 
translation example memory 8, the verb phrase translation example memory 9 and the ottier translation exanple mem- 
ory 10, depending on whetiier tiie input phrase is a noun phrase, a verb phrase or an other phrase tills becomes the 
factor of what kinds of example phrases are stored in the noun phrase translation example memory 8, the verb phrase 
translation example memory 9 or ttie otiier translation example memory 1 0). Specifically, for exanple it is necessary to 
give to extremely general example phrases E of which there are almost no ottier similar exanple phrases a higher prk)ri 
probability P(E) ttian special example phrases and example phrases of which ttiere are many similar example phrases. 

A priori probability P(E) in which tiiese kinds of factor are suitably reflected can for exanple be obtained using cor- 
pus data (training corpus data), which is a set of language expressions typically used in ttie domain in which ttie trans- 
lating apparatus is to be applied. 

That Is, for example accurately selecting from among ttie example" phrases stored in tiie translating apparatus 
tiiose example phrases which are most suitable for translating corpus data is carried out manually or by ottier means 
for all the corpus data. In this case, if the number of example phrases stored in the translating apparatus is expressed 
Ng, the total number of times a selection Is made from among these example phrases is expressed T and ttie nuni)er 
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oftimes an example EiO» 1.2, • • ^ N^) is selected is expressed freq(Ei)/it is possible to obtain the pr^ 
P(Ei) for each exanple E| for Instance from Expi (5). 

Here, when selecting the example phrases most suitable for translation of corpus data manually is problematic it 
is possible to obtain approximate values of the priori probability P(E^ in Exp. (5) by making the translating apparatus 
translate the corpus data. 

That is. as shown In the flew chart of Rg. 11, first. In a step S21. the priori probabilities P(Ei) are aH Initianzed tofbr 
example l/Ng. Then, in a step S22, using this priori probability P(E|), ttie translating apparatus of Rg, 5 is made to trans- 
late alt the corpus data, and at this time the number of times F(Ej) that each example.phrase (E|) is used is counted. 
Then, in a step S23, the priori probabilities P(E{) are renewed for example according to the expression P(E ,) » F(E . 
where E is the summation of F(E|) over I a 1 to Ng. 

After the priori probability P(Ei} has been renewed for all the example phrases processing proceeds to a step 

524 and detects the example phrase Ej^max) Oiven the greatest priori probability. Processing then proceeds to a step 

525 and detennines whether or not the priori probability P(E|(max)) the example phrase Ei(max) is smaUer than a pre- 
determined threshold value, and when it is not smaHer returns to the step S22 and repeats the steps 822 through S25. 
When the priori probability (Ej^rnwo) smaller than the predetenraned threshold value, the priori probabilities (Ej) at that 
time are oulputted as the priori probabilities fbr ttie example phrases E| and processing ends. 

When obtaining corpus data is problematic, it is also possible fbr example to obtain priori probabiliti«5 P(Ei) in tiie 
following way using the example phrases stored in tiie translating apparatus. 

That is. if among tiie words constituting an example phrase E| for which a priori probability P(E|) is to be obtained 
the number of words tiiat match words constituting anottier exarnple phrase is expressed W^ and among tiie slot- 
names of ttie example phrase (Ej) the number of slot-names had by anotiter example Is expressed F^. ttie generaTity of 
that example phrase E|, or its similarity sim-score (Ej) to ottier examples, can for exiample be estimated as (WN+FNy{ttie 
number of words constituting the example phrase EJ. In this case, the priori probability P(Ej) can for eoomple be 
okJiained according to Exp. (6). 

I^slrn-scpre(Ep 

Next, tiie transformation prot}abnity P(1|E) can be otjtained in tiie following way. When a user expresses ttie mean- 
ing content of an example phrase E. it can be considered tiiat witii a certain high probability tiie user will use an expres- 
sion witii words and phrases the same as tiie example phrase E. with a slightiy lower probability tiie user will use an 
expression only slightiy different from tiie example phrase E and wtth a lower probability ttie user will use an expression 
greatiy different from ttie example phrase E. 

Therefore, the input phrase I can be regarded as a variation produced as a result of the expression of the example 
phrase E being dstorted in the language production process of the user. In ttiis connection, tiie following Idrxte of trans- 
forming operators for transforming (distorting) the example phrase E will be introduced. 

That is, as operators operating on ttie woids constituting ttie exan^e phrase E, a word copying operator echo-word 
(ewm,lwn), a word deleting operator deiete-word(ewm). a word inserting operator add-word(rwn) and a word replacing 
operator alter-word(ewm.iWn} wiD be introduced. Also, as operators operating on the slot-name values of the example 
phrase E. a slot-name value copying operator echo-feature(efpi^, a slot-name value deleting operator delete-fea- 
ture(ef^. a slot-name value inserting operator add-faatureCr^ and a slot-name value replacing operator alter-fea- 
ture(efp.H^ wiU be introduced. 

Here, tiie word copying operator echo-word(ewm.iWn) is an operator which wittiout changing it makes the mtti (mtti 
from ttie beginning) word ew^ of the example phr^ E ttie nth word iwn of tiie input phrase I. The word deleting oper- 
ator delete-word(ew,n) is an operator which deletes the mtti word ew^ of ttie example phrase E. The word inserting 
operator is an operator which inserts (adds) the ntti word iw^ of ttie input phrase I to the exairple phrase E. The word 
replacing operator alter-word(ew„,iw,J is an operator which replaces ttie mtti word ew^ of ttie example phrase E witti 
ttie ntii word iWn of ttie input phrase I (here, ttie word iWn is dbse In nieanmg to but different from ttie word ewj. 

The slot-name value copying operator echo-feature(e^ifq) is an operator which wittiout changing it makes the ptti 
slot-name value of the example phrase E (in the case of sfot-name values, their position in the phrase does not partic- 
ulariy affect processing and ttierefore here the order of ttie slot-name values is a matter of convenience) ttie qtti sfot- 
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name value it, of the input phrase I. The slot-name -/alue deleting operator delete-feature(efp} is an operator whteh 
deletes the pth slot-name value efp of the example phrase E. The slot-name value Inserting operator add-ftotureCifo) fe 
an operator which inserts the qlh slot-name value If^ of the Input phrase I into the exanple phrase E. The slot-name 
value replacing operator alter-feature(efpitq) is an operator which replaces the pth slot-name value of the exartple 
phrase E with the qth slot-name value Ifq of the input phrase I (here, the slot-name value iL is dose in meanvig to but 
different from the slot-name value efp). 

Now, if the aforementioned transforming operators as a whole are expressed distort^ and that which is obtained by 
2 transforming operators distorti, distort2, • • • . distort^ being applied to the example phrase E is the input phrase I, 
the input phrase I can be defined as the set of these tran^orming operators, as shown in Exp. (7). 

Mdistorti,distort2, • • • tfistortj (7) 

There may be more than one way of transforming the example phrase E into the Input phrase I, but here it Is suffi- 
cient to consider just a transformation fdtowing a most probable direct route. 
From Exp. C7), the transforniation probability P(I|E) can be express^ 

>(IIE)aP({dislorti^ (8) 

Here, as shown in Exp. (9). the transforming operators can each be regarded as independent 

P{dislort,j|distort,,distort2. • • • • ,distort2;E)i-P(dislort^|E) (9) 

Therefore, from Exp. (9). Exp. (8) can be simplified as shown in Expi (10) and as a result the transfomiation prob- 
ability P(I|E) becomes as shown in Exp. (1 1). 



P({distort,.distort2.* • • •.distort2l|E)-J][P{d'^^^ (10) 

• «-i - ....... 



P(!|E)-I][P(distort^|E) (11) 



Also, because the example phrase E can t>e expressed as the set of the words ewm (m - 1, 2, • • r , M) and the 
slot-name values efp (pal. 2. r • •, P). Exp. (11) becomes as shown In Exp. (12). 



P(l|E)-n''t^^^'^*l^i'®^2** • • •.ewM;efi,ef2.» • • - .efp) (12) 

Z-1 



From Exp. (12), if the probability P(distorlk| ewi.ew2. • • • .ew^; efi.ef2, • • • .efp) is known for the cases wherein 
the transforming operator distort^ is respectively the word copying operator echo^«w)rd(ew„.iWn), the word deleting 
operator delete-word(ewJ, the word inserting operator add-word(iwn). the word replacing operator alter- 
word(ewm,iWn), the slot-name value copying operator echo-feature(efpitq). the slot-name value deleting operator 
delete-feature(efp), the slot-name value inserting operator add-featureCifq) or the slot-name value replacing operator 
alter-feature(efp,ifq). the transformation probab'lity P(I|E) can be obtained. 

Here, the fbllowihg assumptions can be made about the transforming operators, and as a result it is possible to sim- 
plify the probability P(distory ew^.ewg, • • • .ew^,: efi .efg. • • • ,efp) for each of the transforming operators. 

That is, in the case of the word copying operator echo-word(eWm,iWn), the probability of the mth word ew^ of the 
example phrase E being made the nth word iwn of the input phrase I without being changed can be assumed to depend 
only on these words ew^. iw„. Therefore, the probabHity P(echo-word(eWm,iWn)| ewi.ewa, • • • .ew^: efi.efa. • • • ,effp) 
can be simplified as shown in Exp. (13). 

P(echa-word|ew„.iWn)|ew,,ew2.* -.ewjiiefvefg^- ••«fp))- P(ech<H«>rd(ew„.iwi,)) (13) 

Because the probability shown in Exp. (13) is the probability of the word ewn, being as it were copied as a word of the 
input phrase I without being changed, hereinafter it will for convenience be referred to as the word copy probability. 
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In the case of the word deleting operator delete-woid(ew J, It can be assumed that the protabiTity of the mth word 
witn of the example phrase E being deleted depends only on that word ew„^. Therefore, the probability P{delete*. 
woid(ewm)| Wi,ew2» • • • .ew^,; efi.efa, • • • .efp) can be simpOfied as shown In Exp. (14). 

P(delMe-word(ew^|ew,.ew2. • • .ew„;efi,ef2. • • ,ef p))-P(delete-word(ew J) (14) 

Because the probability shewn In Exp. (14) is the probability of the word ew^ being deleted, hereinafter it wfll for con- 
venience be referred to as the word deletion probability 

In the case of the word inserting operator add-wordOw^), It can be assumed that the probability of the nth word iWn 
of the Input phrase I bemg inserted into the example phrase E depends only on that word iw^. Therefore, the probability 
P(add-wordOwn)| ewi,ew2. • • • .ew^: efi.ef2, • • • .efp) can be slrrpRfled as shown In Exp. (15). 

P(add-wordfiwJ|ewi.ew2. • •.ewMief^.efg, • •.efp))«P(add-wordriWn)) (15) 

Because the probability shown in Exp. (15) is the probabifity of the word iw„ being insierted (added), hereinafter it will 
for convenience be referred to as the word insertion probability (or the word adcEtion probabifity). . 

In the case of the word replacing operator alter-word(ewm.iWn), rt can be assumed that the probability of the mth 
word GWm of the example phrase E being replaced with the nth word iWn of the input phrase I depends only on those 
words ewm and iWn- "Therefore, the probabifity P(alter-word(eWm,iWn)| ewi .ew2, • • • ,ewM: efi.efa. • • • ,efp) can be sim- 
plified as shown in Exp. (16). 

P(alter-word(ew„.rWn)|ew,,ew2.- •.ewM;efi.ef2.* •,efp))-P(alter-word(ew„,iWn)) (16) 

Because the probability shown in Exp. (1 6) is the probability of the word ewn, being replaced with the word iw,,, herein- 
after it wfll for convenience be referred to as the word replacement probab'lity. 

Also in the cases of the slot-name value copying operator echo-feature(efp.ifq), the slot-name value deleting oper- 
ator delete-feature(6fp). the slot-name value inserting operator a±i-feature(ifq) and the slot-name value replacing oper- 
ator alter-feature(efp.i^ expressing copying, deleting, inserting and repladng of slot-name values, the same 
assumptions can be made as when the vrards in the word copying operator echo-word(ewn„iWn), the word deleting 
operator delete-word(ew„J, the word Inserting operator add*word(iWn) and the word r^adng operator alter- 
word(ew,n.nv J expressing copying, deleting, inserting and replacing of words are replaced with slot-name values. 

As a result the probabifity of the slot-name value efp being copied (hereinafter for convenience refened to as tiie 
slot-name value copy probability) P(echo-feature(efp.ifq)| ew^.ewz. • • • .ew^; efi.ef2, • • • .efp). the probabflity of tfie 
slot-name value efp being deleted (hereinafter for convenience refen-ed to as the slot-name value deletion probability) 
P(delete-fealure(efjj| ewt.ewa. • • • .bh^: efi.efa, • • • ,efp), the probabifity of the slot-r«me value ifq being inserted 
(hereinafter for convenience refen-ed to as the slot-name value Insertion probabflity) P(add-featureCi<q)| 
ewi,ew2. • • • .eww; efi.ef2. • • • .efp) and ttie probability of the slot-name value efp l>eing replaced with ttie slot-name 
value ifq (hereinafter for convenience referred to as the slot-name value replacement probability) P(aiter-feature(efp.ifq)| 
ewi.ew2, • • • .ew^; efi.6f2. • • * ,efp) can respectively be sirtpOried as shown in Exp& (17) tfvouc^ (20). 

P(echo-feature(efp.if q)|ew,.ew2. • • .ew^ief ^.efg. • • .efp))- P(echo-feature(efp.if q)) (17) 

P(delete-feature(efp)lew,.ew2. • •.ewM;efvef2.* •.efp))-P(deIete-feature(efp)) (18) 

P(add-featureCifq)|ew,,ew2.* •.ew„;ef,.ef2.* •.efp))-P(add-fealurefiffq)) (19) 

P(alter-lfeature(efp,iffq)|ew,.ew2.- ^ewMief^efa.* •.efp))-P(alter-^ (20) 

From the above, if the word copying operator echo-word(eWm.rWn), the word deleting operator delete-word(eWm), 
the word inserting operator add-wordOw^) and the word repladng operator aiter-word(ew„,iWn), which are transforming 
operatora for words (and hereinafter for convenience referred to as word transfomning operatora), are expressed as a 
whole as distort-wordx(eWm,'Wn) and the slot-name value copying operator echo-feature(6fpifq). the slot-name value 
deleting operator delete-feature(efp). the slot-name value inserting operator add-featureOi^ and the slot-name value 
repladng operator alter-f6ature(efp,ifq). which are transforming operators for slot-name values (and hereinafter for con- 
venience referred to as slot-name value transforming operators) are expressed as a whole as distort-featurey (efp.H^. 
the transformation probabifity P(I|E) expressed in Exp. (1 1) can be expressed wHh the prodiict of the transfbrmation 
probabilities (hereinafter for convenience referred to as ttie word transformation probability P(distort-wordx(eWm.iWn)) 
of ttie word transforming operator distort-wordx(ewm.iWn). and the product of the transformation probabilities (hereinaf- 
ter fbr convenience referred to as the slot-name value transform a tion probabilities) P(distort-featijrey(e^r(q)) of ttie slot- 
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name value transforming operator distort-featiir^ 

X Y 

P{i|E)-nP(^^^"^'^x(«w„.iWn))J][P(distort-fea^^ (2i) 

Here. X and Y respectively are the total numbers of word transforming operators and slot-name value transforming 
operators applied, and X+Y = Z {Exp. 11)). 

Next, the methods ty which the word copy probabinty P(echo-word(eWm,iWn)). the word deletion probabflity 
P(delete-word(ew„J), the word Insertfon probability P(add-wordOwn)), the word replacement probability P(alter- 
word(ewn„iWn)), and the slot-name value copy probability P(echo-feature(efpiy), the slot-name value deletion proba- 
bility P(delete-feature(efp)), the slot-name value insertion probability P(add-featurerrfq)) and the slot-name value 
replacemert probability P(after-feature(efp,ifc^) are calculated Will be dera 

First, the word copy probability P(echo-w6rd{ewm.iWn)), the word deletion probability P(de!ete-woid(eWm)) and the 
word insertion probability PCadd^wordOwJ) can be regarded as reflecting human language production behavfor. That 
Is, some types of word have a greater or lesser probability of being copied, deleted or inserted than other words. There- 
fore, when corpus data of the domain in which the translating apparatus is to bid applied is available, it is possible to 
compute distributions of the probability of words being copied, deleted or replaced from corpus data and thereby obtain 
the word copy probability P(echo-word(ewm,iWn)). the vwrd deletion probability P(de!ete^woid(ewm)) and the word 
insertion probability P(add-wordCiw„)). 

When on the other hand corpus data is dff icult to obtain, on the basis of tendencies of words in language produc- 
tion (trends in the treatment of words (for example trends such as words tending to be used unchanged or. tending to 
be deleted or inserted)), it is possible to dassi^ worxfe into a number of groups and fifve a word copying probability 
P(echo-word(ew„,iwJ), a word deletion probability P(delete-word(eWm)) and a word insertion probabBity P(add- 
word(iwn)) to each of these group& ^ ^ . 

In this preferred embodiment, words are classified Into the four groups of strong independent worcte (for exanple 
verbs and nouns and so on); light independent words (for example adjectives and some adverbs (for example 'MATA* 
('again*) and the like); functional words (for example partides and conjunctions); and modifiers (for exairple adverbs 
expressing level (for example TOTEMO* (Ver/) (also including adjuncts), and constants of the kind shown In Exps. (22) 
through (24) are given as the word copy probability P(echo-word(ew„,.IWn)), the word deletion probability P (delete- 
word(ewm)) and the word insertion probability P(add-wordriWn)-) respectively depending on which group the word to 
which ttie word transforrningoperata is to be applied belongs tp^ . , ^ . 

P(echo^ord(ew^,iWj^) «^ echo-strong-content -when ew^ :- . 
is a strong independent word echo-light-content - when ew ^ ^ 
is a light independem word echo-granmratical-function- when ew„ 
is a functional word echo-modrfier - when ew „ is a modifier or an adjunct 

.V 

Here, echo-strong-conterrt, echo-Iight-content, echo-grammatical-function and echo-modifier are predetemriined con- 
stants between 0 and 1 . The word copy probability P(echo-word(ewm,iWn)) is given by Exp. (22) when the wortte ew^ 
and hNfn are the same; when they are not the same, it is made 0 (ttiis is also the same In the case of the slot-name value 
copy probab'Iity. which con-esponds to the word copy probability^ 

P(delete-word(ew J) delete-strong-content - when ew „ 
is a strong independent word delete-light-content - when ew „ 
is a light independent word delete-grammaticai-function - when ew ^ ^^^^ 
is a functional word delete-modif ier - when ew „ is a modifier or an adjunct 

Here, delete-strong-content. delete-Oght-oontent delete-grammatical-function and delete-modifier are predetermined 
constants between 0 and 1 . - 
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P(aJd-wordfiw J) % add-strong-cortent - when iw „ 
» a strong independent word add-light-content - when iw „ . 
is a light independent word add-grammatical-function- when iWn ^^^^ 
is a functional word add-modifier - when iw„ is a modifier or an adjunct 

Here, add-strong-content, add-Fight-content, add-gramnrat'cal-function and add-modifier are predetemuned constants 
between Oarxil. 

in this prefen-ed enixxfiment, in the same way, the slot-name value transformation probablities, namely the slot- 
name value copy probability P(echo-fealure(efp!fq)), the slot-name value deletion probabiTity P(delete-feature(efp)). the 
slot-name value insertion probability P(add-f6atureCii^) and the slot-name value replacement probability P(alter-fea- 
ture(eff>itq)), are also given as constants. 

Next conceming the word replacement probabfllty P(alter-word(ew„.iwJ), this can be bonrputed on the basis off 
the closeness in meaning between the words ew^ and iWn. That is. if the word iWp and the word ew^, are synonyms a 
high probabilrty can be given as the word replacement probabiTrty P(alter-word(ewm,iWn)). if the word iwn and the word 
ewm are similar in meaning a slightly lower probability, and when the word iWp and the word bm^ are unrelated then a 
lower probability can be given to the wrd replacemerit proba^ 

This kind of probability distribution relating to the words ew^ and iWn can be obtained using corpus data. That is, 
using context similarity, when two words are used in similar contexts in the corpus data, the sirfiilarity betweWi these 
two words can be regarded as being high, arid when they are used in different contexts the similarity between two words 
can be regarded as being low. The two words having a high similarity can be given a high word rej^cement probabinty 

When on the other hand corpus data is difficult to obtain, it is possible to compute word replacement probabilities 
for exarrple using a thesaurus oftheldnd shown in Rg. 6. •^^•.:-':^iv,::::^y- 

Here, in the thesaurus, distances in meaning between two words are reflected as the amount of infbrrhatibn 
included Ijy the concept (parent node) comnron to the two words. That fe. when the common concept is an extremely - 
general one like for example thing', it is not possible to obtain from that concept information rating to a specific vvord 
that it indudes, but when the common concept is a considerably limited concept like for cbcample Vehicle with engine*, ^ 
it is possible to obtain from the concept much informatk>n relating to specific words that it Includes. In other words, the 
amount of infonmatkNi included by a given concept (a node in the thesaurus) is inversely proportional to the number of ' 
words included in that concept (the nuirtoer-dominated). ^ • ■-^ : . 

Therefore, the word replacement probabinty P(alter-word(eWm,iWn)) can be obtained from the amount of informa- 
tkjn included in the concept common to both the word ew„ and the word iWn, that is, on the basis of the nun*>er of woids 
induded in that concept in the thesaurus, ftor example accofding to Exp. (25). ^ 

P(after-word(e^^.e>|>>> ^^^^^^^^^^ ^^^ ) (25) 
In the above expression, K is defined as fbllows: 



number-dominated(C(ew„, iw J) 



(26) 



Here, C(ewn„iWn) is a concept common to the words ew^ and iWn. and number-dominated(C(ewn„iWn)) is the nunt>er 
of words included in the concept C(ewm,iWn) (words having the concept C{ewm,iWn) as a higher concepQ. Vocab means 
the set of words in the thesaurus. Therefore, the first and second Z on the right side of Exp. (26) are respectively the 
summations of the words ew^, and iw„ successively substituted for the words in the thesaurus. 

If a method like this wherein the distance between two words ew^, and tw^ is obtained on the basis of the amount 
of infonmation included in a concept conwon to them is used, it is possible to accurately otrtain the distance (relative 
distance) between the words irrespective of the structure of the thesaurus. That is. the thesaurus does not have to be 
constructed regulariy, as it has been in the related art and also the distances between concepts do not have to be the 
sama Also, the method can be applied both to thesauruses wherein words are dassrf ied minutely and to thesauruses 
wherein words are classified roughly It can kkso be appRed to thesaumses wherein classification has been canied out 
mifHitely or roughly only in a part of the thesaurus. Furthermore, the thesaurus may be one dedicated to a certain spe- 
cifk; domain or msy be a general ona 

The transformatkxi probabOrty P(l|^. as was shown above in Exp. (21). can be obtained as the pnxiuct of the word 
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transformation probabilities PCdistort-worcyeWm/iWn)) and the product of the slot-name value transtormation probabili- 
ties P(distort-wordy(efp.ifq)). Of the product of the word transfonnation probabilities P(dlstort-wortlx(eMf„,.iWn)) and the 
product of the slot-name value tran^mation probabilities P(dl8tort-wordy(efp.ifq)), the product of the slot-name value 
transfbrmatlon probabilities P(dlstort-wordy{etp,rfq)) can be obtained relatively simply as tiie product of the slotHfiame 
value copy probabilities P(echo-feature{efp.ifq)), the slot-name value deletion probabiTities P(delete-feature{efp)), the 
slot-name value insertion probabilities P(add-feature(ifq)) and the slot-name value replacement probabilities P(alter- 
fealure(effpi1^). 

That is, concerning grammatical attributes, the grammatical attributes of the input phrase (hereinafter for conven* 
ience referred to as the input attributes) and the grammatical attrfcutes of the example phrase (hereinafter for conven- 
ience referred to as the example attributes) are compared, and example attributes the same as input atbt)utes, 
example attributes not among the input attributes, input attributes not among the example attnlxjtes, and exanple 
attributes which can be replaced with input attrflbutes are detected. When there is an example attribute the same as an 
input attribute, an example attribute not among the input attributes, an input attribute not among the exanple attrftxrtes. 
or an example attribute which can be replaced with an input attribute, the slot-name value copy probability P(echo-fea- 
ture(efp,ifq)). the slot-name value deletion probability P(delete-feature(^p)), the slot-name value insertion probability 
P(add-featureCrfq)) or the slot-name value replacement probability P(alter-feature(ei^iy) can be obtained (however, in 
this prel^ed embocfiment, because as mentioned above these are all given as constants, it is not necessary to com- 
pute them), and by calculating thar product it is possible to obtain the product of the slot-name value transformation 
probabilities P(distort-wordy(efp,ifq)). 

Specifically, when for example tiie input phrase is 'YCP/AKU O SHITAI NO DESU OA* and its slot-name values are 
(desiderative +) (extended-predicate +) (s-part OA) (fbmnial +) and the example phrase is 'YOYAIOI O SHUE ITADAKI- 
TAI NO DESU GA' and its slot-narhe values are (gerund-suffix ITADAKU) (desiderative +) (extended-predicate -i-) (s- 
part OA) (formal +). tiie slot-name values (desiderative +). (extended-predicate +). (s-part OA) and (formal +) are 
detected as example attributes the same as input attnlxites and the slot-name value (gerund-suffix ITADAKU) is 
detected as an example attribute not among tiie input phrase attrSxites. 

In this case, by calculating the product of the slot-name value copy probabHities P(echo-feature(efp,if|^) of the slot- 
name values (desiderative +). (extended-predicate +), (s-part OA) and (formal +) and the slot-name value deletion prob- 
ability P(delete-feature(efp)) of the slot-name value (gerund-suffix ITADAKU) it is possible to obtain the product of the 
slot-name value transfonnation probabilities P(distort-wordy(efp.iy). 

In obtaining the product of the word transformation probabilities P(distort-wofdx(ewm!iWn)). on the other hand. It Is. 
necessary to find tiie as |t were most direct transfonnation route from the example phrase E to tiie input phrase I fifom 
among a number of transformation routes. 

That is. when for example *KOKO Nl 3JI Nl DENWA SURU* as an example phrase E has been assigned the trans- 
lation 'call here at 3 o'clock*, and this example phrase E is used in translation of the input phrase *RESUTORAN Nl 
DENWA SURU*. for ©cample tiie fbllowing two transfonnation routes (transformation methods) are conceivaWa - vi 

Rrstiy. if witii respect to the example phrase *I^NO Nl 3JI Nl DENWA SURU*. from the first word. 'KDKO' is deleted. 
*NI* Is deleted. "SJI* is replaced with 'RESUTORAN*. *Nr is copied, *DENWA is copied, *0' is inserted and 'SURU* is cop- 
ied, the input phrase 'RESUTORAN Nl DENWA SURU* Is obtained. Therefore, in tills case, in tiie translation *can here 
at 3 o'clock*, 'here* con-esponding to 'KOKO Nl* is deleted. *3 o'ctock* corresponding to and "tiie restauranf con-e- 
sponding to 'RESUTORAN' are replaced, and tiie translation 'call at tiie restaurant' is obtained. 

Secondly, if with respect to the example phrase 'KDND Nl 3JI Nl DENWA SURU*. from the first wortl. 'KOKO* is 
replaced witfi 'RESUTORAN*. 'Nl' Is copied, "ajr is deleted. 'Nl' is deleted, TDENWA* is copied, *0* is inserted and 
•SURU* is copied, tfien again the input phrase 'RESUTORAN Nl DENWA SURU* is obtained. In tiiis case, in the trans- 
lation 'call here at 3 o'clock', "here* con-esponding to 'KOKO Nl' is replaced with Ihe restaurant* corresponding to 'RESU- 
TORAN Nl' and 'at 3 o'clock* con-esponding to "SJl Nl* is deleted, and the translation 'call the restauranT is obtained. 

. In this case, tiie translation *call the restaurant*, obtained with the second transformation route, is conrect. and there- 
fbrob in obtaining tiie word transformation probability P(dlstort-wordx(ew„,.iWn)), it is necessary to find tiie as it were 
most direct transfornriation route (hereinafter for convenience referred to as tiie optimum route) like this second trans- 
fbrnf«tion route. 

To do this, even when the lengths of the example phrase E and the input phrase I are different. Le. even when the 
example phrase E and the input phrase I consist of different numbers of words, it is necessary to recognize conresporxl- 
ing words and f ind the best correspondence relationship (alignmenQ between the example phrase E and the ir^ 
phrase I and then find the set of most probably-correct transforming operators (word transfbrming operators) that 
should be applied to transformation of the example phrase E into the Input phrase I. 

Now. if the set of word transforming operators is written Distort (={distort-wordi(ewm»iWn). clistort-worct(ew„.IWn), 
. • • • , distort-wordx(ew„.iWn)}). tiie set Distort„uuc of most probably-conrkt word transforming operators (hereinafter fbr 
convenience referred to as the optimum word transfbrming operators) fo be applied to transformation of the arample 
phrase E into the input phrase I can be obtained using Exp. (27). 
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Distort„«j«maxD,3torttP(Distort|E.01 (27) 

The right side of Exp. (27) means an operation for obtaining the Distort for which the value inside the Q is the maximum. 

Here, the set of optimum word transfbrnting operators is obtained for example using DP (Dynamic Programming). 
That is, first, to obtain from the probability P(Distort|E,0 the distance in DP, its negative log of •log(P(Distort|E.I)) is con- 
sidered. Now, writing D » -log(P(DistortlE.O) . by assuming that the operators constituting the word transforming oper- 
ator set Distort are independent D can be sinrplified as shown in Exp. (28). 

X 

D=-!bg[]P(dislort-word3j(ew„,rWn)|E.I) (28) 

x-1 

If it is assumed that the word transforming operators are independent from those wordSi among the words of the 
input phrase i, which they do not affect directly. Exp. (28) can be further simplified as shown in Exp. (29). 

X 

D=-logf|P(distort-wofdjj(ew„.iwJ|ew„,lw^ (29) 
x*1 

Dividing Exp. (29) for the individual transfoniting operators yields Exp. (30). 

X 

Da£^ogP(dislDrt-wordx(ew„.iWn)) (30) 
x-1 • 

The -log(P(di$tort-wordx(ewm,iw„))) on the right hand side of Exp. (30) is the distance between the words ewm iand 
[w„ when the mth word ewn, of he example phrase and the nth word iw^ of the input phrase are made to conre^pxxid. 
and this can be seen as directly corresponding to individual penalties (weight on ttie path) (cost) used in dynarhic pro- 
gramming equations. Therefore, if distort-wordx(eWn,.iWn) is rewritten as the word (copying operator echo- 
word(ew„.iWn), the word deleting operator delete-word(ew J, the word inserting operator add-word(iWn) and the word 
repladng operator alter-word(ewm,iw„) and the sum of the distances (the cumulative distance) off when as far as the mfli 
word ewm of the example phrase and tiie nth word iwn of the input phrase are made to conresporid is expressed G(m,n), 
Ws cunujiative distance G{m.n) can be obtained regressivdy according to the graduaTized expressiori shown in Exp. 
(31). 



G{rn, n)=min < 



G(m-1 , n-1) - lQgP(€cho-word(ewm.ivvn)} 
G(m. n-1 ) IogP(add-word(iWn)) 
G(m-1 , n) - logP(delete— vvord(eW(n)) 
G(m— 1 , n— 1) — logP(a[ter— word(ewm, Iwa) 

... (31) 



For ttie initial values, sidtable values are assigned. 

By calculating Exp. (31) for until for example m^M . it is possible to obtain the distance (the shortest distance) 
between the input phrase I and the example phrase E. and the patti (DP path) obtained at ttiat tinr^e shows the best (opti- 
mum) alignment between tiie input phrase I and tiie example phr^ E, i.a the optimum path. The set of word trans- 
forming operators which transform the example phrase E to match the input phrase I according to tills optimum patti 
(alignment) are tiie optimum word transfonming operators. 

By adopting the approach described atMve, it is possible to obtain ttie set of optimum word transforming operators 
even when the lengths of tiie example phrase E and tiie input phrase I are (fifferent 

Because the probability P(Distort|E.l) is the prcbabOity of a set of word transforming operators Distort t>eing used 
to collate an input phrase I with an example phrase E and eliminate the differences in expression between the twoi the 
probability model based on tills probabiOty can be called a stochastic expression collating model, and because ttie ds- 
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tance between the input phrase I and the exanple phrase E is obtained on the basis of this stochastic eaqpression col- 
lating model it can be called a stochastic-expression-collating-model-based distance. 

After the set of optimum word transforming operators is obtained in the way described above it is possible to obtain 
the product of the word transfonnation probabilities P(distort-wordx(eWn,.iWn)), and it is then possible to obtain the trans- 
formation probability P(1|E) from this and the product of the slot-name value transfamation probabilities P(di8tort- 
wordy(efp,i^) according to Expi (21). By calculating the product of this transformation probability P(1|E) and the priori 
probabili^ P(E) it Is possible to obtain the translation suitability (P(I|E)P(E)). and the example phrase E d which this 
translation suitability is the greatest is the example phrase most similar to the input phrase I. 

In the converting part 7, the example phrase E rTx>st similar to the input phrase I Is detected in the way described 
abova That is, using a priori probability nxxJel and a transformation probability model respectively based on the priori 
probability P(E) and the transformation probability P(1|E). tiie example phrase E most similar to the input phrase I la 
detected. In this case, by using the transfbrmation probabirrty model, without excessively adding example phrases hav- 
ing superlidal variations and omissions of particles and nouns and hesitations and so on in so-called spoken language 
to example phrases already prepared, it is possible to detect the example phrase most suitable for translating an input 
phrase, and as a result it is possible to increase the quality of translations. 

Next, with reference to the flow chart of Rg. 12. the processing in the converting part 7 by which the example 
phrase E most sirnlar to ttie input phrase I is detected win be described further, it win be assumed that priori probabili- , 
ties P(E) have been assigned to tiie example phrases E and are stored in tiie noun phrase translation example memory 
8, ttie verb phrase translation example menfx>ry 9 and ttte other translation example memory 10. It will be assumed also 
that the word copy probability P(echowrd{ewm,iWn)). the word deletion probability P{delete-word(ewJ) and the wofd 
Insertion probability P(add-wordriWn)), and tfie slot-name value copy probability P(echo-feature(efp.iy). the slot-name 
value deletion probability P(delete-feature(efp)). the slot-name value insertion probability P(add-featureCifq)) and ttie 
slot-name value replacement probability P{alter-feature(efp,iy) are given as constants and stored in a memory inside 
the converting part 7. Also. here, as an input phrase, it will be assumed that for example a noun phrase has t>een input- 
ted into the converting part 7 and ttiat ttierefore processing will be carried out using only ttie example phrases stored In 
the noun phrase translation exanrple memory 8. However, if a verb phrase or an ottier phrase is inputted into the con- 
verting part 7 as the input phrase, the only difference is ttiat the example phrases used in ttie processing become those 
stored in e'rther ttie verb phrase translation example memory 9 or the other translation example memory 10, and the 
processing mettxxl is ttie same as in the case of a noun phrase described in the following. . , . 

When ttie converting part 7 receives an input phrase I (here, as mentioned above, a noun phrase), first, in a step 
831 , ft reads an example phrase (a noun phrase) from ttie noun phrase translation example memory 8 (hereinafter, ttils 
read-out example phrase E will be called ttie focus example phrase). Also, in a step S32. ttie converting part 7 calcu- 
lates ttie transfonmation probability P(I|E) of ttie focus example phrase E witti respect to tiie input phrase I, and ttien 
proceeds to a step S33. In ttie step S33. in tiie converting part 7. ttie probability P(E) assigned to tiie focus example, 
phrase E is read from ttie noun phrase translation example memory 8, and this and ttie transfbrmation probabiii^ P(I|E) > 
calculated in ttie step S32 are multiplied togettier. In ttiis way. ttie translation suitability of ttie focus example phrase E 
witti respect to ttie input phrase I is obtained. 

After ttie translation suitability is obtained, proceeding to a step S34, ttie converting part 7 determines whettier or 
not calculation of a translation suitability has been earned out for all ttie example phrases stored in ttie noun phrase 
translation example memory 8. When in ttie step 834 it is determined ttiat calculation of a translation suitability has not 
been canied out for all the example phrases stored in ttie noun phrase translation example memory 8. processino 
returns to the step 831 , an example phrase E for which a translation suitability has not yet been calculated is read out 
from ttie noun phrase translation example memory 8. and ttie processing of ttie step 832 and ttie siteequent steps is 
repeated witti ttiis as ttie focus example phrase E. 

When on ttie otiier hand it is determined In tiie step S34 ttiat calculation of a translation suitability has been carried 
out on all ttie example phrases stored in ttie noun phrase translation example memory 8, processing proceeds to a step 
835, tiie example given ttie greatest translation suitabRity witti respect to ttie input phrase I is detected in ttie converting 
part 7. and tiien processing ends. 

Next, ttie transformation probability calculation processing of ttie step 832 of Fig. 12 will be desaibed witti refer- 
ence to ttie flow charts of Rg. 13 and Rg. 14. 

Rrst. Rg. 13 is a flow chart illustrating ttie slot-name value transfbrmatton probability calculation processing. The 
converting part 7 first in a step 841 compares the input phrase I to the focus example phrase E and obtains slot-name 
value copy probabilities, slot-name value deletion probabilities, slot-name value insertion probabilities and slot-name 
value replacement probabilities. Then, in a step 842. the converting part 7 obtains a slot-name value transformation 
probability by multiplying these together, whereupon processing ends. 

Rg. 14 is a flow bhart illustrating ttie word transformation probability calculation processing. The converting part 7 
first in a step 851 obtains ttie cumulative distance G(m.n) between ttie input phrase I and ttie focus example phrase E 
according to Exp. (31) and tiien proceeds to a step 852 and determines whettier or not m Is less ttian M (ttie nurrtw 
of words constituting the focus example phrase E). When in ttie step 852 it is detemnined ttiat m is less ttian M, process- 
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ing returro to the step S51 and the converting part 7 increases m and calculates Exp. pi) while changing n. Thea fri 
the step S52. when it is determined that m is not less than M. I.e. when m is equal to M. the path of when the cumdathre 
distance G(m.n) was obtained, i.e. the optimum path, is detected and processing proceeds to a step 853. 

In the step S53. in the converting part 7, the word transformation probabinty is calculated following the obtained 
optinmjm path. That is, word copy probabilities, word deletion probabtlities, word insertion probabilities and word 
replacement probabHities are obtained and by these being rmjitipfied together the word transformation probabifity is 
obtained. With respect to the word replacement probabilities, the converting part 7 controls the similarity degree calcu- 
lating part 1 1 to calculate it according to Exp. (25). That is, the similarity degree calculating part 1 1 obtains the number 
of words included in a concept in the thesaurus stored in the thesaurus memory 12 common to a word of the input 
phrase I and a word constituting the focus example phrase E and on the basis of that number of words obtains a word 
similarity degree expressing the similarity in meaning between the two words. I.e. the word replacement probabiTrty 
obtained by calculating Exp. (25). 

Next. Fig. 15 shows the construction of another preferred emtxxiiment of a translating apparatus according to the 
invention. Parts in Rg. 15 corresponding to parts shown in Fig. 5 have been given tiie same reference numerals and 
wiO not be described in the following. That is. except in that the converting part 7 has a cache 7A. this translating appa- 
ratus is basically constructed In the same way as that shown in Fig. 5. 

The cache 7A temporarily stores provisionally deleted words and provisionally added words. The cache 7A Is caDed 
a *cache* only because it stores information temporarily like a cache memory disposed between a CPU and main mem- 
ory, and H does not necessarily have to be a high-speed memory like an ordinary cache memory. 

In the preferred embodiment described above, as transforming operators operating on words constituting an exam- 
ple phrase E, the word copying operator echowrd(ew^.iwn). tiie word deleting operator deleteword(ew J, the word 
inserting operator add-wordfiWn) and the word repladng operator alter-word{ewm.iWn) were used, but witii these four 
operators it is difficult to deal with cases where tiie example phrase E is transformed into the input phrase I just by the 
positions of the words of the example phrase E being changed (for example when the example phrase E is WATASHI 
WA MIKAN O TABETAI* CI want to eat an orange*) and is to be transformed into the input phrase I "MIKAN O WATASHI 
WA TABETAI*) fan orange - 1 want to eat) and cases where tfie example phrase E is to be transformed into the Input 
phrase I by words of the example phrase E being be replaced witii words of the input phrase 1 and also having their posi- 
tions changed (for example when the exanple phrase E is 'WATASHI WA MIKAN O TABETAP CI want to eat an orange*) 
and is to be transformed into the input phrase I 'RINGO O WATASHI WA TABETAI* Can apple - I want to ear)). 

Also, in cases where there is included in the input phrase 1 a corrpound word (for example a compound noun) oon- ' 
sisting of a word used as a modifier among the words of the input phrase I added to a word of the example phrase E 
(for exanrple, when the example phrase E is 'DENWA O MITSUKEFBJ' Cfind a telephone") and ttie input phrase i is 
*KOUSHUUDENWA O MITSUKERU* (t'lnd a public telephone*) and cases where there is included in the ii^ phrase I 
a compound word consisting of a word used as a mocfif ier among the words of tiie input phrase I added to a word of the 
example phrase E having been replaced with a word of the input phrase I (for example, when the example phrase E is 
•DENWA O MirSUKERU- Cfind a telephone^ and ttie input phrase I is 'IWUSHUUTOIRE O MUSUKERU* Cfind a piMc 
toiletO) it is necessary to make a word of tiie example phrase E correspond witii a compound word of the ir^ phrase 
I On the above examples, 'DENWA' and 'KDUSHUUDENWA* Ctelephone' and Tpublic telephone*). DENWA* and 
'KOUSHUUTOIRE Ctelephone* and "public toiletO). but this is difficult to do with tiie four operators mentioned abova 

To overcome this, instead of tfie word deleting operator (an operator which inreversibly determines deletion at the 
time of its operation) delete-word(ewm). whk:h actually deletes tiie word ew^ of ttie exanple phrase E. a provisfonally 
deleting operator which is an operator for deleting the word eiv^, provisionally will be introduced. Because th^ provision- 
ally deleting operator temporarily removes (^ores) the word being provisionally deleted (hereinafter referred to as tiie 
provisionally deleted word) ew„ to tiie cache 7A, it will be written cache-as<lel6ted(ew^. 

Also, instead of ttie word inserting operator add-word(iwn), which actually inserts the word iw^ of the input phrase 
into the example phrase E (in-eversiUy determines insertion at ttiat time), a provisionally inserting operator which is an 
operator for inserting that word 'iw^ into the example phrase E provisionally will be Introduced. Because ttiis provision- 
ally inserting operator temporarily renx>ves tiie word iWn being provisionally inserted (hereinafter referred to as tiie pro- 
visfonally inserted word) to ttie cache 7A, it will be written cache-as-added(rWn). 

Also, a saambling operator scramble(ew,n,iWn), which is an operator fbr copying unchanged a word ewm of ttie 
example phrase E as a word iWn which among the words of the input phrase lis not ttie word aligned wftti ttie word ewn,, 
or repladng ttie word ewn, witti such a word iWn, i.e. changes only ttie position of a word of ttie example phrase E or 
replaces a word of tiie example phrase E witti a word of ttie iiput phrase I and also changes its position, wil be intro- 
duced. 

Also, a conipounding operator compoundCiWn.t .iWn) which is an operator for identifying two consecutive words iw,|. 
1 and iWp of the input phrase I as constituting a compound word when those two word^are a modifier and a modified 
word will be introduced. In ttiis case, an operator for copying unchanged a word ew^, of ttie example phrase Ease word 
iw„ constituting the phrase input phrase I and making it form a conrpound word with the word iwn.i one word in front of 
it can be expressed as the combination of ttie word copying operator echo-word(ewm,iWn) and the compounc&ig oper- 
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ator conrpoundCiWn.i,iWn). (hereinafter for convenience referred to as the copying and conpounding operator). Ateo, an 
operator for replacing a word ew„ of the example phrase E with a word iw„ of the Input phrase I and then making it form 
a compound word with the word iWn.^ one word in front of it can be expressed as the combination of the word replactng 
operator alter-wofd(ewm.iWn) and the conpounding operator conpoundCiWn.i ,iwn) (hereinafter for convenience referred 
to as the replacing and compounding operator). 

In the preferred embodiment shown in Fig. 15. as operators operating on the words of the example phrase E, the 
six operators that are the word copying operator echo-word(eWn,.iWn). the word replacing operator alter-word(ew„.iWn), 
the provisionally deleting operator cache-as-deleted(ew J, the provislonafly inserting operator cache-as-added(iwj, 
the scrambfing operator scramble(ew„.iwn) and the compounding operata compoundriWn.i,iWn) are errptoyed. Of 
these six operators, the compounding operator compound(iWn.^,iwn) only is not used atone and as described above Is 
used as the copying and compounding operator or the replacing and compounding operator in combination with the 
word copying operator echo-word(ewn„iWn) or the word replacing operator alter-wonj(eWm.lWn). 

Of the newly introduced operators, the provisionally deleting operator cache-as<leleted(ew J and the provisionally 
inserting operator cache-as-addedOWn) can be used in the same ways as the word deleUng operator delete-word(ew,„) 
and the word inserting operator add-wordfiwj respectively (although in the point that they provistonally delete or insert 
words by storing the word in the cache 7A they differ from the word deleting operator delete^ord(ewm) and the word 
inserting operator add^wordCiWn). which irreversibly determine deletion or insertion of the word), but in the use of the 
scrambling operator scramble(ew„.rwn) and the copying and compounding operator and the replacing and conpound- 
ing operator there are the following kinds of limitation. 

That is. transforming an example phrase E into an input phrase I by changing just the positton of a word of the 
example phrase E or replacing a word of the example phrase E with a word of the Input phrase I and then changing ite 
position can be reafized by either deleting the word whose position is to be changed and then inserting it in a predeter- 
mined position or by inserting the word whose position is to be changed in the predetermined position and then deleting 
the word in the original position. 

Therefore, when transforming an exanple phrase E into an input phrase I, to change the position of a word it Is neo- 
essary for that word to have been deleted from the example phrase E or friserted into the input phrase I in Ihe past 
Because of this, the scrambling operator scramble(ewn,,iwn) is only used when a word ew^ of the exarrple pfYase E 
has been provisionally deleted or when a word iw^ of the input phrase I has been provisionally Inserted, ie. when a pro- 
visionally deleted word ew^ or a provisionally Inserted word lw„ has been stored in the cache 7A. . . 

Specifically, in calculating an optimum patii. In a case where tiie transfbrmation path firom the mlh word ewm of the 
example phrase E to the nth word iWp of the Input phrase I is a problem, when a provisionally deleted word ew^ (k<m) 
is stored in the cache 7A, it Is possible to use the scrambling operator scramble(ewK.iWn) (the nth word iWn of the irput 
phrase I can be deemed to have been made not by transforming ttie mth word ewn, of the example phrase E but by a 
transformation changing the position of the kth word ew^ of the exanrple phrase E, without changing the word or after 
replacing the word). - v ' . 

Also, in the case meiTtipned above, also when a provisionally inserted word iWs (s<n) is stored in the cache 7A, it 
is possible to use the scrambling operator scramble(ewm.iWa) (the mth word ew^ of the example phrase E can be 
deemed not to have been transformed into the nth word iw^ of the input phrase I but to have been transformed into the 
sth word iWg of the input phrase I. either wittiout being changed or after being replaced, by having its position changed). 

When ttie scrambling operator Scramble(ewi^.iWn) or the scrambfing operator scramble{ewm,lw8) has been used 
(when in the process of calculating the optimum path a transformation path con-esponding to one of these has been 
selected), because at that tinrie it is finally detemntned that the provisionaily deleted word ew^ or tiie provisionany 
inserted word iw, has not been deleted or inserted, that is, it is finally determined that ite position has been changed, it 
is erased from the cache 7A. 

Next, concerning the copying and compounding operator and the replacing and compounding operator, there are 
the following kinds of limitation on their use. 

That is, to copy without changing it a word ew^ of the example phrase E as a word iWn of tiie input phrase I and 
make it form a compound word with the word iwj^^ one word in front of it, or to replace a word ew^ of the exarrple 
phrase E witii a word iWn of the input phrase I and make it form a compound word with tiie word iw^i one word in front 
of it, it is necessary for the word iw^.^ serving as a modifier to have in the past been Inserted into the input phrase I. 
Consequentiy, the copying and conpounding operator and tiie replacing and conpounding operator are used only 
when a word iw„.i of the input phrase I has been provisionally inserted in tiie past i.e. when the provisionally inserted 
word rwn.1 is stored in the cache 7A. 

Specifically, in calculating the optimum path, when the transformation path from tiie mth word ew^ of the ocanple 
phrase E to the nth word iWn of the input phrase I is a problem, the copying and compounding operator and the replacbig 
and compounding operator can b4 used when the provisionally inserted word iw^.^ has been staed in the'cache 7A, 
i.e. when tiie word iw^.^ has t>een provisionally inserted in front of the word iw^. 

When the copying and compounding operator or the replacing and compounding operator has been used, because 
at that point in time it is finally determined tiiat tiie provisionally inserted word iwn.i has not been inserted, i.a it is finally 
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determined that it oonsfitutes a conpound word. It is erased from the cache 7A. 

Next to obtain the word transformation prdDabilities of when transformation of words is carded out using these six 
transfbrnwg operators {tiie word copying operator echo-word(eWm.iWn), the word replacing operator alter- 
word(ewm.iWn), the provisionally deleting operator cache-as-deieted(ew„0> provisionally Inserting operator cache- 
as-addedCiWn), the scrambling operator scramble(ew„.iwn) and the compounding operator compoundCiWn.i.iWn)). prob- 
abilities of the transfbrrrang operators (probabilities of transfbrnnations using the transfbmning operators being carried 
out) are necessary, and these can t>e defined for example as follows. 

That fe. first, the probability (the word copying probabiTrty) P{echo-word(eWm,iWn)) for the word copying opoator 
echo-word{ewm.iWn) and the probability (the word replacement probabilrty) P(alter-word(ewn^.iWn)) for the word replac- 
ing operator alter-word(eWin.iWn) are given by Exp. (22) and Exp. (25) respectively, as in the preferred embodiment 
descnbed abova 

Next, the probability (hereinafter for convenience referred to as the provisional deletion probability) P(cache>a8- 
deleted(ew,Tj) fbr the provisionally deleting operator cache-as<leleted(ewnJ Is ^en for example using the word dele- 
tion probability P(delete-word(eWm)) and the word copy probability P(echo^«)rd(ew„,iWn)) so as to satisty the foQowing 
expresa'oa 

P(delete-woid(ew J)<P(cache-as-delet6d(ewj)<P(echo^ord(ew„.iwJ) (32) 

Here, the provisional deletion probabifity P(cache-as-deleted(ewnJ) has a value satisfying Exp. (32) and for exam- 
ple lower than the word replacement probability P(alter-word(ew„„iWp}) for a similar word and Itigher than the word 
replacemem probabifity P(atter-word(eWm;iWn)) for a dissimi^^ 

The probabHity for the provisionally inserting operator cache-as-added(iWn) (hweinafter for convenience referred to 
as the provisional insertion probability) P(cache-as-addedCiWn)) is given fbr example using the word insertion probability 
P(add-wordCiWn)) and the word copy probabirity P{echo-word(ew„,.iWn)) so as to satisfy the following expressioa ' 

P(add-wordrwn))<P(cache-as-addedfWn))<P(echo-word(ew^.^ (33) 

Here/the provisional insertion probability P(cache-as-addedCiWn)) has a value satisfying Exp. (33) and for exanple 
lower than the word replacement probabiFity P(after-word(ewm.iWn)) fbr a similar word and higher than the word replace- 
ment probability P(alter-word(ew,n,iWn)) for a dissimilar word. ^ 

Also, when the word ew^ Is the same as the word rwn, the word copy probability P(echo-word{ewn„iWn)) in Expe. v 
(32) and (33) is given by Exp. (22) (when tiie words ew„ and iWp are not the same, because as mentioned above the 
word copy probability P(echo-word(ewn^.iWn)) becomes 0. in tiiis case, the upper limit of the provisional deletion proba- 
bility P(cache-as-del6ted(ewrn)) and the provisional Insertion probability P(cache-as-addedCiWn)) is not limited by the 
word copy probability P(echo-word(eWni,iWn))). 

The probability P(scrarnble(ewm,iwn)) (hereinafter for convienience referred to as the scramble probability) for the 
scrarribfing operator scramble(ewm,'rW|0 can be gven by the fbll^^ 

P(scrarnble(ew„,iw^) af(P(aIter-wo!d(ew^,iw„)),iw„) wheii ew^ # rw^ (35) 

P(scran«e(ew „,iwn)) a f(P(echo-vw>rd(ew ^.iw J).iw „) wh 

In Exp. (35), the functiori fQ obtains the likelihood of occurrence of a phenomenon (saambiing) of ttie order of 
words or phrases and so on changing, arxi can be set taking into account for example the nature of the word stored in 
the cache 7A (fbr example its part of speech), the language of the input phrase I and ttie situation in which the input 
phrase i Is being used. That is, ttie function fQ can be set so as to give a higher probability with respect to input phrases 
I of Japanese, in wttich word order changes are freer tiian in English and ttie like, than to input phrases I of EngTish. The 
function fO can also be set so as to give a higher probabDity witti respect to spoken language, in which word order 
changes such as inversions are more Dkely to occur than in written languaga 

In Exp. (35), as ttie argument of the function fQ, when ew„^„ ttie word replacement probability P(alter- 
word(ew,n,iWn)) is used; this is because when the words ewn, and iw^ are not the same (when ew ^^^rw „ ), it can be con- 
sidered ttiat the word of the example phrase E has been replaced and the word order has been changed, and when ttie 
words ew„ and iWn are ttie same (when ew^aiw^) it can be considered ttiat the word of ttie example phrase E has 
been copied without being changed and only the word order has been changed. 

In ExpL (35). in both ttie cases of ew „^ „ and ew „siw „ , as the argument of the function fQ. the word iwp of ttie 
input phrase I is used; ttiis is becau^ it can be supposed that ttie likelihood of a word order change varies d^penc&ig 
on the word iWr,. That is. for example in Japanese ttiere are almost no cases where a verb is nxsved to ttie beginning, 
and therefore when the word iWn is a verb it takes a value such ttiat the scramble probability P(scrant3le(ew„.iw,0) is 
smaO. 
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Next, the probability (hereinafter for convenience referred to as the compound prcbabflity) P(coni)oundrn 
is for example given by the following expression. 



In B<p. (36), the function g(iw„.i,iwn) expresses the probabUity of the consecutive words iw^^ and iwn forming a 
compound word, and this can be cafoulated for example in the following way. That is, it is possible for exan^e to 
prestore numerous examples of compound words and find the compound word mo^ similar to the corrttnalion of the 
words iw^.^ , iw^ and normalize its degree of similarity to obtain a probability of them forming a compound word. 

When the degree of similarity between the combination and the stored compound word exanple is large awugh, 
the compound probability P(compoundfWn.i.iWn)) Is higher than the word insertion probability P(add-wordfiWn)). 

The probability (hereinafter for convenience referred to as the copy and compound probability) of a word ew^ of the 
example phrase E being copied as a word iWn of the input phrase I without being changed and forming a conpound 
word together with the word in front of it is the product of the word copy probability P(echo-vw)rd(ew„,IWn)) and the com- 
pound probability P(compound(iw„.i.lWn)), and the probability (hereinafter for convenience referred to as the replace 
and compound probability) of a word ew^ of the example phrase E being replaced with a word iWn of the input phrase 
I and forming a compound word together with the word in front of it is the product of the word replacement probability 
P(alter-word(ew„.iWn)) and the compound probability P(compoundriWn.i Jwn)). 

With the six transforming operators of the word copying operator echO'W)rd(ewm.iWn), the word replacing operator 
alter-word(ewn,.iWn), the provislonaUy deleting operator cache^8-<Jeleted(ew J, the provisionally inserting operator 
cache-as-addedCiWn), the scrambling operator scramble{ewm.lwn) and the compounding operator con|)oundfiw^i,iwn) 
It is possible to carry out the following eight kinds of transformatfon. 

That is, firstly, with the word copying operator echo-word(ewm.iwn), it is possible to make the word ew^ of the exam- 
ple phrase E the word iWn of the input phrase I without changing it. Secondly, with the word replacing operator alter- 
word(ew„„,iWn). It is possfole to replace the word ew^ of the exanple phrase E with the word iwp of the input phrase I. 
Thirdly, with the provisionally deleting operator cache-asKJeleted(ew J, H is possible to provisionally delete the vwxd 
ewn, of the example phrase E. and fourthly, with the provisionally inserting operator cadie-as-added(iwn). it is possible 
to provisionally insert the word iWn of the input phrase I. 

Also, with the scrambling operator scramble(ewm,iwn), it is possible to carry out the following two word order 
changes. That is, fifthly, with the scrambfing operator Scramble(ewi^,iwJ, it is possible to change the word order by 
irreerting a provisionally deleted word ewj^ stored in the cache 7A as the nth word iWn of the Input phrase I, and sixthly, 
with the saambling operator scramble(ewn„iw8), it is possible to make a provisionally inserted word iWj a change in the 
word order of the mth word ew^ of the example phrase E. 

Seventhly, with the. combination (copying and compounding operator) of the word copying operator echo- 
word(ew„.iWn) and the compounding operator compoundfiw^i,iw„), it is possible without changing it to copy the word 
ewn, off the exaniple phrase E as the word ew^, of the input phrase I and make it form a compound word with the word 
iwn.1 one word in front of it and eighthly, with the combination (replacing and compounding operator) of the word replac- 
ing operator alter-word(ew,„,iWn) and the compounding operator compound{iw„.i,iWn), it is possible to replace the word 
ew/m of the example phrase E with the word iw„ of the input phrase 1 and make it fomi a compound word with the word , 
iw^.^ one word in front of it . . , . . . J . 

Considering finding the optimum set of transforming operators able to canry out these eight transformations. i.e. the 
optirmjm path, by means off DP matcNng, this can be done for example by a calculating cumulative distance Q(m.n) I 
using weightings off patiis conresponding to the eight transformations shown ki Exp. (37), similar to the case In Exp. (31). 



P{compoundfiw^i .iw J)=gfiw^,.«w„) 



(36) 
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G(m, n)=min^ 



G(m-1 , n— 1) - logP(echo— word(ewm,iWn)) 
G(m-1, n-1) - logP(aIter-word(ewn,/tWn)) 
G(m-1, n)— IogP(cache-as— del8t9d{ewin)) 
G(m. a-1)— logP(cache-as— addedCiWn)) 
G(m, n-1)- logP(scrambIe(ewK.iwn)) 
G(m-1, n)- logP(scramble(ewni.iWs)) 
G(m-1, n-1) - logP(echo— woftj(ewm.iwn)) 

- logP(compound(iwn--i',iWn)) 
G{m-1, n-1) ~ lpgP(alter— word(ewm.iwn)) 

— logP(compound(iwn-i,iiWn)) 



(37) 



In the preferred embodiment shown in Fig. 15. in the converting part 7. by calculating Exp. (37) until m-M , using 
the cache 7A as necessary, an optimum path is found and a vrord transfprmation probabTity is calculated. 

That is, as shown in the flow chart of Fig. 16, in the converting part 7. forst in a st^ S61, according to Bqx p7), 
the cumulative distance G(m.n) of when the word ew^ constituting a focus exanpfe phrase E is made to correspond 
with the word iwn of the irput phrase I IS found 

Then, proceeding to a step S62, it is determined whether or not in the calculation off the cumulative distance Q(m,n) 
in the step S61 the path corresponding to the provisionally deleting operator cache-asKleleted(ew^. La the IWid 
expression from the top on the right hand side in Expi (37) was selected, and when rt is determined that it was selected 
processing proceeds to a step S63. In the step S63, the word ew^, provisionally deleted by the pravisiormlly deleting 
operator cache-as<Ieleted(ew J Is removed to (stored in) the cache 7A and processing proceeds to a step S64. - 

When in the step S62 it is determined that the path corresponding to the provisionally deleting operator cache-as- 
deleted(ewnj was not selected, the step S63 is skipped and processing proceeds to the step S64, wherein it is deter- 
mined whether or not In the calculation of the cumulative distance G(m.n) the path corresponding to the provfeionally 
inserting operator cache-as-addedfiw^), La the fourth expression from the top on the right hand side in Exp. (37), was 
selected. When in the step S64 it is determined that the path con-esponding to the provisionally inserting operator 
cache-as-addedfiwn) was selected, processing proceeds to a step S65 and the word ew„ provisimally inserted by the 
provisionally inserting operator cache-as-addedCiWn) is removed to the cache 7A, and processing proceeds to a step 



When in the step S64 it is d^emrwned that the path corresponcfing to the provisionally inserting operator cache-as- 
addedfiwp) was not selected, processing sk?ps the step S65 and proceeds to a step S66, wherein it Is detemwied 
whether or not in the calculation of the cumulative distance G(m.n) a path con-esponding to the copying and conpound- 
ing operator or the replacing and compounding operator. La the expression on the severth line or the eighth 5ne from 
the top on the right side in Exp. (37), was selected. When In the step S66 it is determined that a patii coaesponcfing to 
the copying and compounding operator or the repladng and compounding operator was selected, processing ^mceeds 
to a step S67, and because a provisionally deleted word *iWn.i made a modifier conrtituting a confound word by the 
copying and compounding operator or the replacffig and compounding operator is stored in the cache 7A this is erased 
and processing proceeds to the step S68. 

When in the step S66 it is determined that neither of the paths con-esponding to tiie copying and conpounding 
operator or the repladng and compouncSng operator was selected, processing skips the step S67 and proceeds to the 
step S68 and it is detenmined whether or not in tiie calculation off the cumulative distance G(m,n) a path con-esponding 
to the scrambling operator scramble(ewj«.iWn) or the scrambling operator scramble(ewm.iwa). La the expression on the 
fifth or the sixth line from the top on the right side in Exp. (37) was selected. When in the step S68 it is detemtined that 
a path con-esponding to the scrambling operator scramble(ewj5.iWn) or the scrambling operator scramble(ewm,iwj was 
selected, processing proceeds to a step S69 wherein because a provisionaily deleted word ew^ or a proviskanally 
inserted word iws is stored in the cache 7A this is erased, and then proceeds to a step S70. 

When on the other hand in the step S68 it is determined tiiat neither of the paths corresponding to the scrambiffig 
operators scramble(ewk.iwn) or scramble(ewm,iwa) were selected, processing skqss the step S69 and proceeds to the 
step S70. wherein it is detemrrined whether or not m & less than M (the number of ¥w>rds constituting the focus exanple 
phrase E). When it is determined tfiat m is less than M. processing returns to the step S61 and ttie converting part 7 
increases m and n and repeats the subsequent processing. When in the step S70 it is determined that m is less than 
M, La m is equal to M, processing proceeds to a step S71 and it is determined whether or not a provisfonaQy deleted 
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word ew„ or a pransionally inserted vwrd Wn Is sto 

When en the step S71 it is determined that a provisionally deleted word ew^ or a provisionally inserted word iw„ is 
stored in the cache 7A, processing proceeds to a step S72 and, among the paths of when the cumulative distance 
G(M.n) was obtained, the path corresponding to the provlsionalty deleting operator cache-as<leleted(ew J having pro- 
visionally deleted the provisionally deleted word ew„ stored in the cache 7A or the path corresponding to the provision- 
ally inserting operator cache-as-addedfiWn) having provisionally inserted the provisionally Inserted woid iWn stored In 
the cache 7A is respectively converted into the path corresponding to a word deleting operator delete-word(ew,^ or a 
path con-esponding to the word Inserting operator add-word(iwJ and the palh obtained as a result is taken as the opti- 
mum path, and processing proceeds to a step S73. 

When in the step S71 it is detennined that neither a provlsionalty deleted word ew^ nor a provisionally inserted 
word iWn is stored in the cache 7A, the path of when the cumulative distance G(M.n) was obtained is taken mchanged 
as the optimum path, processing proceeds to the step S73 and the word transtbmnation probability is calculated IbOow^ 
ing this optimum path. 

In this case, the optimum path includes a path corresponding to the provisionally deleting operator cached 
deleted(ew^ or the provisionally inserting operator cache-as-addedfiWn) respectively having provisionally deleted or 
provisionally inserted the provisionally deleted word ew^ or the provisionally inserted viwd iWn erased from tte cache 
7A, but this path can be ignored In cateulating the word transformation probability. . . 

As a result of the scrambling operator scramble(ew„.iWn) being introduced together with the provisionally deleting 
operator cache-as<le!eted(ew„J and the provisionally inserting operator cache^-addedfiWn), as described above, 
even when there is a difference in word order between an input phrase I and an example phrase E which are similar, it 
becomes possible to determine their similarity correctly. , 

Also, as a result of the compounding operator compoundfiWn-i.iWn) being introduced, it becomes possible to handle 
conpound words as well. 

A calculating method using memory means like the cache 7A for terrporarily storing provisionally deleted words 
,and provisionally inserted words is applied to language models for voice recognition In for exanple "A Cache-based 
Natural Language Model for Speech Recognition", Roland Kuhn and Renato De Mori, IEEE Transaclfons on PWem 
Analysis and Machine Intelligence, vol.12, No.6, June 1990. 

Here, a language model based on the hypothesis that the probability of a recentiy-i^ word appearing (being spo- 
ken) again is higher than the probabUity of that word first appearing is proposed, wherein recently-used words are 
stored In memory means and the probability of an ith word W| being a predetermined word W is calculated with refer- 
ence to this memory means. 

That fe, for example in an ordinary 3g-gram model, vthm the parts of speech of the i-lth and i-2th words are 
respectively written g^ and g^2. the probability PCW,-W|g^2.g^^) of the ith word W, being a predetermined word Wis 
calculated for example using the following expressfon. . . >: 



P(W,=W|g^2.g^0= £ P(W,=W|g,=gpP(g,=gj|g^2.9H) (38) 
QieQ 



Here. Q is a set of parts of speech in the language that is the Object Of the voice recognition. 

In the aforementioned literature, a method of calculating the P(W,aWlg ,=g .) on the right side of Expi (38) accord- 
ing to the followring Exp. 09) by storing recentfy-used words in storing means is proposed. 

P(W,=W|g,=gp=kMjXfOA^pW|g,:=gj)44i^xCj^^ (39) 

Here, k^j+k^j =* 1 a«l kMj and kcj are botti variables Indexed only by J. 

The Cj(W.i) on the right side of Exp. (39) a probabiltty catoulated by refening to the memory means, and changes 
depending on whether the word W has appeared recently. 

Therefore, the means In the prefered embodiment shown In Fig. 15 has something in common vvith the means off 
tiie aforementioned literature, wherein words are stored in memory means and probabilities are cateulated with refer- 
ence to that, in tiie point that words are stored in the cache 7A and probabilities are calculated with reference to this; 
however, it differs from the means of the aforementioned literature, wherein words having appeared recently are stored 
in the storing means, in that only provisionally deleted words and provisfonally inserted words are stored in the cache 
7A. 

With the compounding operator compoundrrw„.i ,iWn). as well as it beinQ possible to handle compound nouns of the 
kind mentioned above, it is possible for example to handle such times as when the example phrase E is 'INU WA 
KAWAir (*dogs are cute*) and the input phrase I is mTASHI NO INU WA KAWAIi' Cmy dog is cuts') and a conpound 
word WATASHI NO INU' Cmy dogO made up of the modifier WATASHI NO' CnV) added to the word 'INir Cdog*) off the 
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input phrase I corresponds to the word 'INU' of the exanple phrase E. However, in cases such as "WATASHI NO INU* 
Cmy dog*) wherein it is clear that WATASHI NO' Cm/) modifies 'INir Cdog*), by applying the replacing operator atter- 
wordf INLT.WATASHI NO INU*) to the 'INU* of the example phrase E and the WATASHI NO INU* of the input phrase I 
and using a noun phrase example WATASHI NO X* Ony X) (where X represents a noun), it Is possible to process the 
phrase correctly and nriore efficiently than when the oonr|X)unding 0^ 

That is, the compounding operator compomdfiWn.i,iWn) is effective in cases such as when a compound woid of 
which the modfying relationship undear, for example one not containing the (possessive) particle 'NO' (for exanple 
a compound noun) is inputted into the inputting part 1, or when in a voice recognition device in the inputting part 1 a 
partide is not recognized and for example WATASHI, INU WA KAWAll' is outputted firom the inputting part 1 with 
respect to an actual input of mTASHI NO INU WA KAWAir. 

In the above, a case wherein the invention is applied to a translating apparatus for carrying out translation from Jap- 
anese into Eng!^ was described, but the invention can also be appGed to translating apparatuses for performing trans- 
lation between other languages. 

It is possible to provide the translating apparatus with buttons for connecting and confimting translation results out- 
putted from the oulputting part 1 6 (hereinafter for corivenience refon^ed to respectively as the correction button and the 
conTrmation button). The translating apparatus can be made to carry out subsequent processing when the confirmation 
button is operated confirming that the translation result is correct Also, the translating apparatus can bo made to carry 
out translation processing again when the correction button is operated irxficating that the translation result is wrong. 
That is. for example, the translating apparatus can be made to detect the next highest translation suitability and cany 
out translation on the basis of that exampla Furthermore, it Is also possit4e to provide the translating apparatus wrtti a 
control part for correcting translation resiits outputted from the oulputting part 16. In tills case, when translation results 
have been corrected by the control part being operated, it Is possible to store examples corresponding to the results of 
this correction in the noun phrase translation example memory 8, the verb phrase translation example memory 9 or the 
other translation example menfX)ry 10. ^ \ - v v i . 

Although in the prefered embodiments described above a priori probability P(E{) was found for each exanple E| for 
example according to Exp. (5) or as described above with reference to 1 1. for tiiis priori probabiTrty P(E|) it is also pos- 
sible to use the same value, for example the inverse l/Ng of the number of examples N^. for all ttie exanples Ej.^ 

In these preferred embodments, in the similarity degree calculating part 1 1 , the word replaconent probability was 
found on the basis of the number of words induded in a concept common to the two words concerned, but it is also pos- 
sible for the word replacement probability to be found on the basis of the level of the dass to which the concept common 
to the two words belongs in the thesaurus, as in the related art. That is, when the number of classes of the thesaurus : 
Is written C and the level of the dass to which the concept common to a word of an input phrase I and a word ew^ ^ 
of an example phrase E belongs is written L. it Is possible to find, the word replacement probability P(alter- 
word(eWm,iWn)) according to the following expression. 

P(after:word(ew„,iwJ)--4T (40) 

Also, afthough in the prefen'ed embodiments the word deletion probability and so on were determined on the basis 
ofthepartofspeechoftheword(vvhetherthewordlsanindependemwordorafuncti^ depending 
on the domain in which ttie translating apparatus is to be used there are cases in which it is effective to detenn'ne them 
on the basis of infbnriation otfier than tfie part of speech. That is, it is possible to dete^ 

and so on taking into consideration factors such as that in spoken language pronouns of the first and second person 
and so on tend to be oniitted and thm woids conveying a topic in context tend 
to the people talking. 

Also, it is possible to give the word insertion probability and the word deletion probabilify a predetermined constant 
value to apply in all cases, as shown in Exps. (41) and (42) respectively. 

P(add-wordCiwJ)«add-word (41) 

P(delete-word(ew„))sdelete^Mord (42) 

The word deletion prcbability and so on can also be given accorcfing to a predetermined calculation fonmula. 

Although in the preferred embodiments translation was carried out in dause or phrase units, it is also possible for 
translation to be carried out in other syntax units. It Is also possible for translation to be carried out without the first lan- 
guage sentence being divided into such syntax units. Furthermore, although in the preferred embodiments desofoed 
above the example phrases were stored dassified into noun phrases, verb phrases and other phrases, the method of 
classification is not linked to this. 
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As described above, in a translating apparatus and a translating method accorcing to the invention, a first language 
sentence is divided into Q^ntax units consisting of predeternrtined units of sentence structure and exanples similar to 
these syntax units of the first language sentence are detected. Then, on the basis of these detected exanples, the f irst 
language sentence is translated in syntax units and the results of translation of the syntax units are conpounded to 
generate a second language sentence. Therefore, efficient translalion processing becomes possible and as a result the 
translation processing speed can t)e increased. 

In another translating method according to the invention, the first language sentence is divided into syntax units 
consisting of predetermined units of sentence structure, in stages from large syntax units into small syntax units, arxJ 
the syntax units obtained at each stage are collated with examples, and examples corresponding to the syntax units are 
detected on the basis of the results of this collation. The first language sentence is then translated In syntax units on 
the basis of.translations oon-esponding to these detected examples, and the results of translalion of the syntax units are 
compounded to generate a second language sentence. Therefore, efficient translation processing becomes possible 
and as a result the translation processing speed can be inaeased. 

In another translating apparatus and translating method according to the invention, a degree of similarity express- 
ing the similarity between a word of a first language sentence and a word of an example is calculated on the basis of 
the number of words included in a concept common to the word of the fffst language sentence and the word of the 
example In a thesaurus wherein words are classified in a tree structure on the basis of their concepts. Therefore, it 
becomes possible to calculate an accurate degree of similarity in^espective of the structure of the the^urus and as a 
result the translation processing speed can be increased. 

In another translating apparatus and another translating method according to the inventioa an exanple similar to 
a first language sentence is detected on the basis of a priori probabirity of the exarrple being used to translate the first 
language sentence and a transformation probability of the example being transformed into the first language sentence 
and used, and the first language sentence is translated on the basis of a translation con-esponding to the detected 
example. Therefore, it is possibla to perform translation processing which is suh^ 

In another translating apparatus and translating method according to the Inventioa by syntax analysis being per- 
formed on a first language sentence, grammatical attrbutes of the sentence are extracted and an exarrple similar to tfie 
first language sentence is detected taidng into account grammatical attrSxites of the first language sentence and of the 
exarrple. The forst language sentence is then translated on the basis of a translation corresponding to the detected 
example. Therefore, it becomes unnecessary to store multiple superf idaily different exanples having the same mean- 
ingcontent. ^-.^ 

Various other modifications and application examples are of the invention are conceivable within the scope of the 
invention. Ther^ore^ the invention is not liniited to the preferred embodimeri^ 

Claims 

1. A translating apparatus fbr translating a first language sentence expressed in a first language Into a secorKDan- 
guage sentence expressed in a second language, comprising: 

example memory means (3) storing examples expressed in said first language and corresponding translations 
of said examples in said second language: 

dividing means (4) for dividing said first language sentence into syntax units consisting of predetermined units 
of sentence structure: 

detecting means (6) for detecting for each of said syntax units of said first language sentence oulputted tyy said 
dividing rneans an exanple similar to ttiat syntax unit from among said exairple^^^ 
translating means C?) for translating said first language sentence m said syntax units on the basis of those of 
said translations conesponding to tiiose of said exanples detected by said detecting means (6); and 
generating means (1 5) for generating said second language sentence by compounding results of translation of 
said syntax units obtained by said translating means. 

2. A translating apparatus according to claim 1 . wherein said example menrxxry means (3) stores said examples and 
said conresponding translations in said syntax units. 

3. A translating apparatus according to claim 1 or 2. wherein said dividing means (4) dMdes said first language sen- 
tence in stages from large syntax units to small syntaxunits. 

4. A translating sipparatus according to claim 3. wherein said dividing means (4) conpriseidause dividing mearts fbr 
dividing said first language sentence into clause units and phrase dividing means fbr dividing said clause units of 
said first language sentence outputted from said dause dividing means into phrase units. 
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5. A translaling apparatus according to claim 4, wherein said example memory means P) comprises dause unit 
example memory means (5) storing examples in clause units and 

phrase unit example memory means storing phrase examples in phrase units. 

6. A translating apparatus according to daim 5. wherein said phrase unit example memory means (3) stores said 
exanr^es dassiTied at least into noun phrases and verb phrases. 

7. A translating apparatus according to claim 3. wherein said dividing means (4) divides said first language sentence 
into large syntax units and when none of said examples matches any of said large syntax units divides said large 
syntax units into smaller syntax units. 

8. A translating apparatus according to claim 1. wherein said example memory means 0) stores said acarTf)les 
together with grammatical attnbutes thereof. 

9. A translating apparatus according to daim 8, wherein said dividing means (4) divides said first language sentence 
into said syntax units and extracts grammatical attnlxites of said syntax units t)y carrying out syntax analysis and 
said detecting means (6) detects those of said examples similar to said clause units taking into account grairvnat- 
icai attributes of said syntax units and said examples. 

1 0. A translating apparatus according to anyone of the preceding daims. wherein said detecting means (6) detects said 
example similar to said syntax unit on ttie basis of a priori probability of said example being used to translate said 
syntax unit and a transformation probability off said exanple being transformed into said syntax unit and used. 

11. A translating apparatus according to daim 10. wherein said transformation probability is calculated from a gram- 
matical attribute transformation probability of grammatical attributes of said example being transfbrmed arxJ a word 
transformation probabifity of words of said example being transformed. 

12. A translating apparatus according to daim 1 1 , wherein said grammatical attribute transformation probabinty Is cal- 
culated from a grammatical attribute copy probability of a grammatical attrfoute of sakJ example befrig used^ 
un chang ed, a gram matical attribute deletion probability of a grammatical attribute of said example being deleted, a'' 
grammatical attribute addition probability of a grammatical attribute of said syntax unit being added, and a gram-^ 
matical attribute replacement probability of a grammatical attribute off said example being replaced with a grammat-^ 
icat attritxite of said syntax unit. 

13. A translating apparatus according to daim 11, wherein said word transfomiation probabifity is calculated from a 
word copy probability of a word of said example being used unchanged, a word deletion probabOity of a word of 
said example being deleted, a word addition probability of a word of said syntax ur^ being added, and a word 
replacement probability of a word of said example b&ng replaced with a word of said syntax unit, 

14. A translating apparatus according to daim 13, further conprising: 

tiiesaurus memory means (1 2) storing a tiiesaurus wherein words are dassified in a tree structure on the basis 
of tiieir concepts; and . 

word similarity degree calculating means (1 1) for calculating a word similarity degree expressing a similarity 
between a word of said syntax unit and a word of said example on the bads of said tiiesaur^ (12) and output- 
ting said word similarity degree as said word replacement probability. 

15. A translating apparatus according to daim 14. wherein said word similarity degree calculating means (1 1) calcu- 
lates said word similarity degree on the basis of ttie number of words induded in a concept common to said word 
of said syntax unit and said word of said example in said ttiesaurus. 

1 6. A translating apparatus according to claim 1 1 , wherein said word transformation probability is calculated according 
to an optinium trarisformation path for transformirig said exanrple irrto said syrT^ 

1 7. A translating apparatus according to daim 1 6. wherein said optimum transformation path is found by dynanvc pro- 
gramming. 

18. A translating apparatus according to daim 13. wherein said word transformation probability is calculatedfurttier 
using at least one aniong a word scramble probability of a word of said example unchanged or replaced witti a woitl 
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of said syntax unit being used with its position changed, a word copy and corrpound probability of a word of said 
example having a modifier added thereto and being used as a compound word, and a word replace and compound 
probability of a word of said example replaced with a word of said syntax unit having a modifier added thereto and 
being used as a compound word. 

5 

19. A translating apparatus according to claim 18. wherein said word transformation probability is calculated according 
to an optinrium trarisfbrmation path Ibr trarisforming said eocanple 

20. A translating apparatus according to claim 19, wherein in finding said optimum transformation path Instead of an 
10 actual deletion path con-esponding to actually deleting a word of said example a provisional deletion path provision- 
ally deleting a word of said example is used and instead of an actual addition path con-esponding to actually adding 
a word of said syntax unit a provisional addition path provisionally adding a word of said syntax unit is used, 

further comprising word memory means for storing a provisionally deleted word and a provisionally added 
word provisionally deleted or added according to said provisional deletion path or said provisional addition path. 
16 , ^ , 

21. A translating apparatus according to daim 20, wherein a scramble patii corresponding to using a word of said 
example unchanged or replaced witti a word of said syntax unit with its position changed is selected only when 
either a word of said example has been stored in said word memory means as said provisionally dieted word or a 
word of said syntax unit has been stored in said word memory means as said provisionally added word, and there- 
to after said word of said example stored as said provisionally deleted word or said word of. said syntax unit stored as 

said provisionally axkied word is erased from said word memory means, and . 

a copy and compound path corresponding to a word of said example having a modifier added thereto and 
being used as a compound word and a replace and compound path corresponding to a word of said example 
replaced with a word of said syntax unit having a modifier added to it and being used as a conpound word are 
25 selected only when a word to become said rrxxiifier has been stored in said word memory means as said provision- 
ally added word, and tiiereafter said woid to become a modifier stored as said provisionaily added word is erased 
from said word nrarriory means. . . 

22. A translating apparatus according to daim 21, wherein after 0ie end point of a transformation path transfbrming 
30 said example into said syntax unit is reached a provisfonal deletion path or a provisional adding path provisionally 

deleting or adding said provisionally deleted word or said provisionally added word stored in said word memory 
means Is converted Into an actual deleting path or an actual adding path respectively and the path obtained Is 
made said optimum transtomnation patti. 

^ 23. A translating apparatus according to claim 10, wherein said priori probabifity or said transfbrniation probability is 
calculated on the basis of corpus data. .. . 

24. A translating apparatus according to claim 10, wherein said priori probability or said transfdrnntion probability is 
calculated on the basis of said examples stored in said example memory means (3). 

40 

25. A translating metiiod for translating a first language sentence expressed in a first language into a second language . 
sentence expressed in a second language using example memory means (3) storing examples expressed in said 
first language and corresponding translations of said examples in said second language, comprising the steps of: 

45 dividing (4) said first language sentence into syntax units consisting of pred^emrtined units of sentence struc- 

ture; 

detecting (6) for each of said syntax units of said first language sentence an example similar to ttiat syntax unit 
from among said exarrples; 

translating (7) said first language sentence in said syntax units on the basis of those of said translations corre- 
60 spending to the detected examples; and 

generating (15) said second language sentence by compounding results of translation of said syntax unit& 

26. A translating mettiod for translating a first language sentence expressed in a first language into a second language 
sentence expressed in a second language using example memory means (3,5) storing examples expressed In said 

55 first language and con-esponding translations of said examples in said second language, comprising the steps of: 

dividing (4) said first language sentence into syntax units consisting of predetemruned units of sentence struc- 
ture in stages from large syntax units into small syntax units and collating syntax units obtained at each stage 
with said examples; 
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detecting (6) for each of said syntax units an example corresponding to that syntax unit from among said exam- 
ples on the t)asis of results of the collation; 

translating (7) said first language sentence in said syntax units on the basis of those of said translations corre- 
sporxiing to the detected examples: and 
s generating (15) said second language sentence by compounding resuHs of translation of said syntax unita 

27. A translating apparatus for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed In a second language, comprising: 

10 example memory means (3,5) storing examples expressed In said first language and corresponding transla- 

tions of said examples in said second language; 

thesaurus memory means (1 2) storing a thesaurus wherein words are classified in a tree structure on the basis 
of their concepts; and 

calcinating means (1 1) for calculating word similarity degrees each expressing the sln^'larity between a word 
15 Of said first language sentence and a word of one of said examples on the basis of said thesaurus (12); 

detecting means (6) for detecting from anx)ng said examples an example sinvlar to said first language sen- 
tence on the basis of said word similarity degrees calculated said caknjiating ni^^ 
translating means (7) for translating said first language sentence on tiie basis of a translation among said 
translations corresponding to said example detected by said detecting means, 
20 wherein said calculating means (1 1) calculates each of said word similarity degrees on the basis of the 

number of words included ill a concept common to a word of 

said first language sentence emd a word of one of said examples in said thesaurus. 

25 2a A translating method for translating a first language sentence expressed in a first language into a second language 
sentence expressed in a second language using example rnenvxy means (3,5) storing examples expressed in said 
first language and con'espording translations of said examples in sM second language, oonrqxising tiie steps of: 

calculating (11) word similarity degrees each expressing tiie similarity between a word of said first language 
90 sentence and a word of one of said examples on the basis of a thesaurus (12) wherein words are classified in 

a tree structure on the basis of their concepts; 

detecting (6) from sunong said examples an example similar to said first language sentence on the basis of said 
word similarity degrees; and 

translating (7) said first language sentence on the basis of a translation among said translations corresponding 
3S to said example detected; 

wherein each of said word sirralarity degrees is calculated on the basis of the number of words included 
in a concept convTx>n to a word of said first language sent 
saurus(12). 

40 29. A translating apparatus for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed in a second language, comprising: 

example memory means (3,5) storing examples expressed in said first language arxJ corresporxiing transla- 
tions of said examples in said secorxJ language; 
45 detecting means (6) for detecting from anx>ng said exairples an example similar to said first langu^e sen- 

tence on the basis of a priori probability of said example being used for translating said first language sentence 
and a transfonmation probabiTrty of said example being transformed into said first language sentence and used; 
and 

translating means (7) for translating said first language sentence on tiie basis of a translation anwng said 
so translations corresporxling to said example detected by said detecting means (6). 

30. A translating apparatus according to claim 29. wherein said transformation probabiOty is calculated from a gram- 
matical attribute transformation prok)abiIity of grammatical attrfoutes of said example being transformed and a word 
transformation probab'Dty of words of said example being transform^^ 

55 

31 . A translating apparatus acccMing to daim 30, wherein said grammatical attribute transfomnation prob^ility is cal- 
culated from a grammatical attrfoute copy probability of a grammatical attribute of said exarrple k>eing used 
undianged, a grammatical attrfoute deletion probability of a grammatical attribute of said example being deleted, a 
grammatical attrfoute addition probability of a grammatical attribute of said first language sentence being added. 
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and a grammatical attribute replacement probabHity of a grammatical attribute of said exanple being replaced with 
a grammatical attribute of said first language sentence 

32. A translating apparatus according to daim 30, wherein said word transfonnation probability is calculated from a 
word copy probabiTrty of a word of said example being used unchanged, a word deletion probability of a word of 
said example being deleted, a word addition probabiPity of a word of said first language sentence being added, and 
a word replacement probability of a word of said example being replaced with a word of said fust language'sen- 
tenca 



10 33. A translating apparatus according to claim 29, wherein said word transformation probability is calculated according 
to an optimum transformation path for transforming said example Into said first language sentence. 

34. A translating apparatus according to daim 33, wherein said optimum transformation patti Is fbund by dynamic pro- 

* gramming. 

IS 

35. A translating apparatus according to daim 32. wherein said word transformation probability is calculated further 
using at least one amorig a word scramble probability of a word of said exairple unchanged or replaced wittt a word 

of said first language sentence being used with rts position changed, a word copy and coimpound probability of a rv- 
word of said example having a modifier added thereto and being used as a compound word, and a word replacei^S 
20 and compound probability of a word of said example replaced wfth a word of said frst language sentence having a 
modifier added tiiereto and being used as a compound word. 

36. A translating apparatus according to claim 32, wherein said word transformation probability is calculated according 
to an optimum transformation path for transfonming said example into said first language senterKe. 

2S . ■-. ... .-. . v.- . .. . .. 

37. A translating apparatus according to claim 36, wherein in finding said optimum trarisfornfiation path instead of an 
actual deletion path corresponding to actually deleting a word of said exanrple a provisional deletion path provision- 
ally deleting a word of said example is used and instead of an actual addition path corresponding to actually adding 
a word of said first language sentence a provisional addition path provisionally adding a word of said first language 

so sentence is used, 

furtiier comprising word memory means fbr storing a provisionally deleted word and a provisionally added 
word provisfonally deleted or added according to said provisional deletion patti or said provisional addition patii. 

38. A translating apparatus according to daim 37, wherein a saamble patti corresponding to using a word of said 
ss example unchanged or replaced witti a word of said first language sentence witti its position changed is selected . 

only when either a word of said example has been stored in said word memory means as said provisfonally deleted 
word or a word of said first language sentence has been stored in said word memory means as said provisionally 
added word, and ttiereafter said word of said example stored as said provisionally deleted word or said word of said 
syntax unit stored as said provisionally added word is erased from said word memory means, and 

40 a copy and compound path coresponding to a word of said example having a modifier added thereto and 

being used as a compound word and a replace and compound palh conresponding to a word of said exairple 
replaced witti a word of said fust language sentence having a modifier added to it and being used as a con^xxuxJ 
word are selected only when a word to become said modifier has been stored in said word memory means as said 
provisionally added word, and ttiereafter said word to become a modifier stored as said provisfonally added word 

45 is erased from said word memory means. 

39. A translating apparatus according to daimJSS, wherein after ttie end point ,of a transfbmiation patti transforming 
said example into said first language sentence is reached a provisional d^etion patti or a provisional adcBtion patti 
provisionally deleting or adding said provisionally deleted word or said provisionally added word stored in said word 

so memory means is converted into an actual deleting patti or am actual addition patti respectively and ttie patti 
obtained is made the optimum transfbrmation patti. 

40. A translating mettiod for translating a first language sentence expressed in a first language into a second language 
sentence expressed iri a second language using example memory means (3.6) storing examples expressed in said 

^ first language and con'esponding translatfons of said exanrples in said second language, comprising ttie steps of: 

detecting (6) from among said examples an example similar to said first language sentence on ttie basis of a 
priori probability of said exanple being used for translating said first language sentence and a transformation 
probability of said example being transformed into ssad first language sentence and used; and 
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translating (7) sakl first language sentence on the basis of a translation among said translations oorresponding 
to said example detected. 

41. A translating apparatus for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed in a second language, comprising: 

example memory means (3.5) storing examples expressed in said first language arid oonresponding transla- 
tions of said examples in said second language together with grammatical attributes of said examples; 
extracting means for extracting granvnatical attrfoutes of said ftfst language sentence by carrying out syntax 
analysis of said ftst language sentence; ' 
detecting means (6) for detecting among said examples an example sinrtilar to said forst language sentence tak- 
ing into account grammatical attrfoutes of said first language sentence and of said examples; and 
translating means (7) for translating said first language sentence on the basis of a translation among said 
translations oorresponding to said example detected by said detecting means. 

42. A translating method for translating a first language sentence expressed in a first language into a second language 
sentence expressed in a second language using example memory means storing examples expressed in said first 
language and oorresponding translattons of said examples in said second language and grammatical attrftxjtes of 
said examples, oomprisihg the steps of: 

extracting grammatical attrixites of said first language sentence by carrying out syntax analysis of said fnrst 
language sentence; 

detecting (6) anrang said examples ah example similar to said first langu^e sentence talgng into account 
gramrnatlcal attritxites of said first language senterKe ard of saidexanples; a^ 

translating (7) said first language sentence on the basis of a translation among said translations whfoh corre- 
sponds to said example detected. 
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